Research on a Mongolian Text to Speech Model Based on Ghost and ILPCnet

Research on a Mongolian Text to Speech Model Based on Ghost and ILPCnet

The core challenge of speech synthesis technology is how to convert text information into an audible audio form to meet the needs of users. In recent years, the quality of speech synthesis based on end-to-end speech synthesis models has been significantly improved. However, due to the characteristic...

Full description

Bibliographic Details
Main Authors:	Qing-Dao-Er-Ji Ren, Lele Wang, Wenjing Zhang, Leixiao Li
Format:	Article
Language:	English
Published:	MDPI AG 2024-01-01
Series:	Applied Sciences
Subjects:	Mongolian speech synthesis non-autoregressive Ghost module vocoder
Online Access:	https://www.mdpi.com/2076-3417/14/2/625

Similar Items

Quality differences in male and female vocoded speech.
by: Christopher, Deborah Kaye.
Published: (2024)

Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU
by: Keisuke Matsubara, et al.
Published: (2021-01-01)

Practical approaches to speed coding/
by: 453791 Papamichalis, Panos E.
Published: (1987)

Speech Processing Research Program
by: Lim, Jae S., et al.
Published: (2010)

Speech intelligibility changes the temporal evolution of neural speech tracking
by: Ya-Ping Chen, et al.
Published: (2023-03-01)

Contributions of Temporal Modulation Cues in Temporal Amplitude Envelope of Speech to Urgency Perception
by: Masashi Unoki, et al.
Published: (2023-05-01)

SelfRemaster: Self-Supervised Speech Restoration for Historical Audio Resources
by: Takaaki Saeki, et al.
Published: (2023-01-01)

Lexical Effects on the Perceived Clarity of Noise-Vocoded Speech in Younger and Older Listeners
by: Terrin N. Tamati, et al.
Published: (2022-04-01)

Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
by: Axel Roebel, et al.
Published: (2022-02-01)

Contribution of Common Modulation Spectral Features to Vocal-Emotion Recognition of Noise-Vocoded Speech in Noisy Reverberant Environments
by: Taiyang Guo, et al.
Published: (2022-10-01)

An Improved Noise Reduction Technique for Enhancing the Intelligibility of Sinewave Vocoded Speech: Implication in Cochlear Implants
by: Venkateswarlu Poluboina, et al.
Published: (2023-01-01)

An auditory perspective on phonological development in infancy
by: Monica Hegde, et al.
Published: (2024-01-01)

Modality-Specific Perceptual Learning of Vocoded Auditory versus Lipread Speech: Different Effects of Prior Information
by: Lynne E. Bernstein, et al.
Published: (2023-06-01)

Self-Modulated Ghost Imaging in Dynamic Scattering Media
by: Ying Yu, et al.
Published: (2023-11-01)

Detection of Non-native Speaker Status from Backwards and Vocoded Content-masked Speech
by: Arkadiusz Rojczyk, et al.
Published: (2021-01-01)

Identification of Minimal Pairs of Japanese Pitch Accent in Noise-Vocoded Speech
by: Yukiko Sugiyama
Published: (2022-05-01)

Data Transmission over GSM Voice Channel
by: A. A. Panov
Published: (2012-03-01)

Neural correlates of individual differences in predicting ambiguous sounds comprehension level
by: Yi Lin, et al.
Published: (2022-05-01)

Speech Processing Research Program
by: Lim, Jae S., et al.
Published: (2010)

The Relationship between Turkic and Mongolian and Errors in Detection of Turkic and Mongolian Loan Words in Persian
by: Mehdi Rezaei
Published: (2019-05-01)

Three factors are critical in order to synthesize intelligible noise-vocoded Japanese speech
by: Takuya eKishida, et al.
Published: (2016-04-01)

Recognition of foreign-accented vocoded speech by native English listeners
by: Yang Jing, et al.
Published: (2023-01-01)

THE REPRESENTATION OF THE GHOST ARCHETYPE IN THE CANTERVILLE GHOST BY OSCAR WILDE
by: Safaryan Agata Vladimirovna
Published: (2015-03-01)

True ghost stories /
by: Dowswell, Paul, author, et al.
Published: (2002)

Fundamentals of voice-quality engineering in wireless networks /
by: 419049 Perry, Avi
Published: (2007)

Present Situation of Mongolian Development and its Future Prospects
by: L Tsedendamba
Published: (2011-03-01)

Upregulation of cognitive control networks in older adults’ speech comprehension
by: Julia eErb, et al.
Published: (2013-12-01)

Text-to-speech synthesis /
by: 460849 Taylor, Paul
Published: (2009)

The Contribution of Cognitive Factors to Individual Differences in Understanding Noise-Vocoded Speech in Young and Older Adults
by: Stephanie Rosemann, et al.
Published: (2017-06-01)

Optimized pointwise convolution operation by Ghost blocks
by: Xinzheng Xu, et al.
Published: (2023-03-01)

Comparative Study for Multi-Speaker Mongolian TTS with a New Corpus
by: Kailin Liang, et al.
Published: (2023-03-01)

The ‘Mongol World’ and Mongolian-Russian Relations: Factors of Influence Revisited
by: Demberel Kolyagiyn
Published: (2021-12-01)

Effects of age on long term memory for degraded speech
by: Christiane Thiel, et al.
Published: (2016-09-01)

Divination with Khulil as Practiced by Mongolians
by: Anna D. Tsendina
Published: (2021-10-01)

Ghost stories from the past /
by: Bond, Ruskin
Published: (2008)

Mongolian People’s Republic: Stages of Language Policy Revisited
by: Karina I. Bikmaeva
Published: (2022-08-01)

100 GHOSTS : a gallery of harmless haunts /
by: Horner, Doogie, author 635164
Published: (2013)

EMPHASIS, ONOMATOPOEIA/EXCLAMATION WORDS IN MONGOLIAN
by: Tuncer GÜLENSOY
Published: (2019-06-01)

The sources of O.M. Kovalevsky's Mongolian-Russian-French Dictionary
by: V.L. Uspensky
Published: (2018-12-01)

Effect of spectral degradation on speech intelligibility and cortical representation
by: Hyo Jung Choi, et al.
Published: (2024-04-01)