Exploring Efficient Neural Architectures for Linguistic–Acoustic Mapping in Text-To-Speech

Conversion from text to speech relies on the accurate mapping from linguistic to acoustic symbol sequences, for which current practice employs recurrent statistical models such as recurrent neural networks. Despite the good performance of such models (in terms of low distortion in the generated spee...

Full description

Bibliographic Details
Main Authors:	Santiago Pascual, Joan Serrà, Antonio Bonafonte
Format:	Article
Language:	English
Published:	MDPI AG 2019-08-01
Series:	Applied Sciences
Subjects:	recurrent neural networks self-attention quasi-recurrent neural networks deep learning acoustic model speech synthesis text-to-speech
Online Access:	https://www.mdpi.com/2076-3417/9/16/3391

Internet

https://www.mdpi.com/2076-3417/9/16/3391

Exploring Efficient Neural Architectures for Linguistic–Acoustic Mapping in Text-To-Speech

Internet

Similar Items