Multi Speaker Natural Speech Synthesis Using Generative Flows
Modern speech synthesis systems generate natural speech and have high performance. Models using generative flows, among others, have shown impressive results, allowing you to form a variety of speech pronunciation from a given text. However, they are focused on synthesizing the voice of one given sp...
Main Author: | Dmitry Obukhov |
---|---|
Format: | Article |
Language: | Russian |
Published: |
The Fund for Promotion of Internet media, IT education, human development «League Internet Media»
2021-12-01
|
Series: | Современные информационные технологии и IT-образование |
Subjects: | |
Online Access: | http://sitito.cs.msu.ru/index.php/SITITO/article/view/807 |
Similar Items
-
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
by: Yi Zhao, et al.
Published: (2018-01-01) -
SuperFormer: Enhanced Multi-Speaker Speech Separation Network Combining Channel and Spatial Adaptability
by: Yanji Jiang, et al.
Published: (2022-07-01) -
Speaker-turn aware diarization for speech-based cognitive assessments
by: Sean Shensheng Xu, et al.
Published: (2024-01-01) -
Comparison of Modern Deep Learning Models for Speaker Verification
by: Vitalii Brydinskyi, et al.
Published: (2024-02-01) -
Effective Zero-Shot Multi-Speaker Text-to-Speech Technique Using Information Perturbation and a Speaker Encoder
by: Chae-Woon Bang, et al.
Published: (2023-12-01)