DIA-TTS: Deep-Inherited Attention-Based Text-to-Speech Synthesizer

Text-to-speech (TTS) synthesizers have been widely used as a vital assistive tool in various fields. Traditional sequence-to-sequence (seq2seq) TTS such as Tacotron2 uses a single soft attention mechanism for encoder and decoder alignment tasks, which is the biggest shortcoming that incorrectly or r...

Full description

Bibliographic Details
Main Authors:	Junxiao Yu, Zhengyuan Xu, Xu He, Jian Wang, Bin Liu, Rui Feng, Songsheng Zhu, Wei Wang, Jianqing Li
Format:	Article
Language:	English
Published:	MDPI AG 2022-12-01
Series:	Entropy
Subjects:	natural language processing text-to-speech deep learning information theory deep neural network local-sensitive attention
Online Access:	https://www.mdpi.com/1099-4300/25/1/41

Internet

https://www.mdpi.com/1099-4300/25/1/41

DIA-TTS: Deep-Inherited Attention-Based Text-to-Speech Synthesizer

Internet

Similar Items