An Emotion Speech Synthesis Method Based on VITS
People and things can be connected through the Internet of Things (IoT), and speech synthesis is one of the key technologies. At this stage, end-to-end speech synthesis systems are capable of synthesizing relatively realistic human voices, but the current commonly used parallel text-to-speech suffer...
Main Authors: | Wei Zhao, Zheng Yang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-02-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/4/2225 |
Similar Items
-
MaxMViT-MLP: Multiaxis and Multiscale Vision Transformers Fusion Network for Speech Emotion Recognition
by: Kah Liang Ong, et al.
Published: (2024-01-01) -
Lexical Dependent Emotion Detection Using Synthetic Speech Reference
by: Reza Lotfian, et al.
Published: (2019-01-01) -
Collective Emotional Intelligence and Group Dynamics Interplay: Can It Be Tangible and Measurable?
by: Eleni Fotopoulou, et al.
Published: (2022-01-01) -
Writ Large on Your Face: Observing Emotions Using Automatic Facial Analysis
by: Dieckmann Anja, et al.
Published: (2014-05-01) -
IoT-Enabled WBAN and Machine Learning for Speech Emotion Recognition in Patients
by: Damilola D. Olatinwo, et al.
Published: (2023-03-01)