Unit selection speech synthesis for text-to-speech systems

Speech is the means of communication in the vocal form, used to express one’s emotions, thoughts and feelings. Research in the field of speech generation has been ongoing for several decades, and it has evidently made significant progress with the introduction of systems like Siri, Alexa and Google...

Full description

Bibliographic Details
Main Author:	Gupta, Vanya
Other Authors:	Lin Weisi
Format:	Final Year Project (FYP)
Language:	English
Published:	2017
Subjects:	DRNTU::Engineering::Computer science and engineering
Online Access:	http://hdl.handle.net/10356/70470

_version_	1811693592621613056
author	Gupta, Vanya
author2	Lin Weisi
author_facet	Lin Weisi Gupta, Vanya
author_sort	Gupta, Vanya
collection	NTU
description	Speech is the means of communication in the vocal form, used to express one’s emotions, thoughts and feelings. Research in the field of speech generation has been ongoing for several decades, and it has evidently made significant progress with the introduction of systems like Siri, Alexa and Google Assistant. With a rise in conversational form of interactions between humans and computers, it becomes crucial to make the speech technology as realistic, reliable and intelligent to be useful to the masses. Several techniques have been developed and explored, which has helped incorporate these systems into our everyday lives like automated responses on the telephones, announcements on the train or metro station or as an aid to those who are visually blind or those who have lost their ability to speak. Despite the complexities and the challenges involved, it comes as no surprise that this field has received a lot of attention and resources during the last few decades, with the main goal of creating systems that mimic human understanding of speech. This report focuses on the concatenative synthesis approach to build the text-to-speech system, while maintaining speech intelligibility and quality at appropriate levels.
first_indexed	2024-10-01T06:54:08Z
format	Final Year Project (FYP)
id	ntu-10356/70470
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T06:54:08Z
publishDate	2017
record_format	dspace
spelling	ntu-10356/704702023-03-03T20:28:08Z Unit selection speech synthesis for text-to-speech systems Gupta, Vanya Lin Weisi School of Computer Science and Engineering A*STAR Institute for Infocomm Research (I2R) Huang Dong-Yan DRNTU::Engineering::Computer science and engineering Speech is the means of communication in the vocal form, used to express one’s emotions, thoughts and feelings. Research in the field of speech generation has been ongoing for several decades, and it has evidently made significant progress with the introduction of systems like Siri, Alexa and Google Assistant. With a rise in conversational form of interactions between humans and computers, it becomes crucial to make the speech technology as realistic, reliable and intelligent to be useful to the masses. Several techniques have been developed and explored, which has helped incorporate these systems into our everyday lives like automated responses on the telephones, announcements on the train or metro station or as an aid to those who are visually blind or those who have lost their ability to speak. Despite the complexities and the challenges involved, it comes as no surprise that this field has received a lot of attention and resources during the last few decades, with the main goal of creating systems that mimic human understanding of speech. This report focuses on the concatenative synthesis approach to build the text-to-speech system, while maintaining speech intelligibility and quality at appropriate levels. Bachelor of Engineering (Computer Science) 2017-04-25T01:06:18Z 2017-04-25T01:06:18Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/70470 en Nanyang Technological University 44 p. application/pdf
spellingShingle	DRNTU::Engineering::Computer science and engineering Gupta, Vanya Unit selection speech synthesis for text-to-speech systems
title	Unit selection speech synthesis for text-to-speech systems
title_full	Unit selection speech synthesis for text-to-speech systems
title_fullStr	Unit selection speech synthesis for text-to-speech systems
title_full_unstemmed	Unit selection speech synthesis for text-to-speech systems
title_short	Unit selection speech synthesis for text-to-speech systems
title_sort	unit selection speech synthesis for text to speech systems
topic	DRNTU::Engineering::Computer science and engineering
url	http://hdl.handle.net/10356/70470
work_keys_str_mv	AT guptavanya unitselectionspeechsynthesisfortexttospeechsystems

Unit selection speech synthesis for text-to-speech systems

Similar Items