Research on a Mongolian Text to Speech Model Based on Ghost and ILPCnet
The core challenge of speech synthesis technology is how to convert text information into an audible audio form to meet the needs of users. In recent years, the quality of speech synthesis based on end-to-end speech synthesis models has been significantly improved. However, due to the characteristic...
Main Authors: | Qing-Dao-Er-Ji Ren, Lele Wang, Wenjing Zhang, Leixiao Li |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-01-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/14/2/625 |
Similar Items
-
Quality differences in male and female vocoded speech.
by: Christopher, Deborah Kaye.
Published: (2024) -
Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU
by: Keisuke Matsubara, et al.
Published: (2021-01-01) -
Practical approaches to speed coding/
by: 453791 Papamichalis, Panos E.
Published: (1987) -
Speech Processing Research Program
by: Lim, Jae S., et al.
Published: (2010) -
Speech intelligibility changes the temporal evolution of neural speech tracking
by: Ya-Ping Chen, et al.
Published: (2023-03-01)