An Emotion Speech Synthesis Method Based on VITS

An Emotion Speech Synthesis Method Based on VITS

People and things can be connected through the Internet of Things (IoT), and speech synthesis is one of the key technologies. At this stage, end-to-end speech synthesis systems are capable of synthesizing relatively realistic human voices, but the current commonly used parallel text-to-speech suffer...

Full description

Bibliographic Details
Main Authors:	Wei Zhao, Zheng Yang
Format:	Article
Language:	English
Published:	MDPI AG 2023-02-01
Series:	Applied Sciences
Subjects:	IoT Emo-VITS emotional speech synthesis emotion feature fusion
Online Access:	https://www.mdpi.com/2076-3417/13/4/2225

Similar Items

MaxMViT-MLP: Multiaxis and Multiscale Vision Transformers Fusion Network for Speech Emotion Recognition
by: Kah Liang Ong, et al.
Published: (2024-01-01)

Lexical Dependent Emotion Detection Using Synthetic Speech Reference
by: Reza Lotfian, et al.
Published: (2019-01-01)

Collective Emotional Intelligence and Group Dynamics Interplay: Can It Be Tangible and Measurable?
by: Eleni Fotopoulou, et al.
Published: (2022-01-01)

Writ Large on Your Face: Observing Emotions Using Automatic Facial Analysis
by: Dieckmann Anja, et al.
Published: (2014-05-01)

IoT-Enabled WBAN and Machine Learning for Speech Emotion Recognition in Patients
by: Damilola D. Olatinwo, et al.
Published: (2023-03-01)

A Feature Fusion Model with Data Augmentation for Speech Emotion Recognition
by: Zhongwen Tu, et al.
Published: (2023-03-01)

Emotion Perception in Members of Norwegian Mensa
by: Jens Egeland
Published: (2019-01-01)

IMPROVED SPEAKER-INDEPENDENT EMOTION RECOGNITION FROM SPEECH USING TWO-STAGE FEATURE REDUCTION
by: Hasrul Mohd Nazid, et al.
Published: (2015-04-01)

Development, Validation and Application of an Inventory on Emo-Sensory Intelligence
by: Reza Pishghadam, et al.
Published: (2020-12-01)

Expression of basic emotions in Estonian parametric text-to-speech synthesis
by: Kairi Tamuri, et al.
Published: (2015-12-01)

A Facial and Vocal Expression Based Comprehensive Framework for Real-Time Student Stress Monitoring in an IoT-Fog-Cloud Environment
by: Madanjit Singh, et al.
Published: (2022-01-01)

Exploring Prosodic Features Modelling for Secondary Emotions Needed for Empathetic Speech Synthesis
by: Jesin James, et al.
Published: (2023-03-01)

Determination of Formant Features in Czech and Slovak for GMM Emotional Speech Classifier
by: J. Pribil, et al.
Published: (2013-04-01)

Response: Commentary: Emotion Perception in Members of Norwegian Mensa
by: Jens Egeland, et al.
Published: (2019-10-01)

Speech emotion classification using fractal dimension-based features
by: Gintautas Tamulevičius, et al.
Published: (2019-09-01)

Exploration of an Independent Training Framework for Speech Emotion Recognition
by: Shunming Zhong, et al.
Published: (2020-01-01)

Knowledge enhancement for speech emotion recognition via multi-level acoustic feature
by: Huan Zhao, et al.
Published: (2024-12-01)

A Comparison of Machine Learning Algorithms and Feature Sets for Automatic Vocal Emotion Recognition in Speech
by: Cem Doğdu, et al.
Published: (2022-10-01)

Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources
by: Huda Barakat, et al.
Published: (2024-02-01)

ZSE-VITS: A Zero-Shot Expressive Voice Cloning Method Based on VITS
by: Jiaxin Li, et al.
Published: (2023-02-01)

Semi-Supervised Learning for Robust Emotional Speech Synthesis with Limited Data
by: Jialin Zhang, et al.
Published: (2023-05-01)

Design of a Multi-Condition Emotional Speech Synthesizer
by: Sung-Woo Byun, et al.
Published: (2021-01-01)

Speech Emotion Recognition through Hybrid Features and Convolutional Neural Network
by: Ala Saleh Alluhaidan, et al.
Published: (2023-04-01)

Khảo sát độc lực của virus viêm gan vịt phân lập từ đàn vịt tỉnh Hậu Giang
by: Phạm Công Uẩn, et al.
Published: (2018-02-01)

A Novel Heterogeneous Parallel Convolution Bi-LSTM for Speech Emotion Recognition
by: Huiyun Zhang, et al.
Published: (2021-10-01)

Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01)

An artificial intelligence-based classifier for musical emotion expression in media education
by: Jue Lian
Published: (2023-07-01)

Research on Speech Emotion Recognition Based on Teager Energy Operator Coefficients and Inverted MFCC Feature Fusion
by: Feifan Wang, et al.
Published: (2023-08-01)

EmoSocio: An open access sociometry-enriched Emotional Intelligence model
by: Eleni Fotopoulou, et al.
Published: (2021-11-01)

Design of the Speech Emotion Recognition Model
by: Hanping Ke, et al.
Published: (2023-07-01)

Human–Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention
by: Waleed Alsabhan
Published: (2023-01-01)

A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food Recognition
by: Guorui Sheng, et al.
Published: (2024-01-01)

Speech emotion recognition based on emotion perception
by: Gang Liu, et al.
Published: (2023-05-01)

Comparative Performance Analysis of Metaheuristic Feature Selection Methods for Speech Emotion Recognition
by: Ozseven Turgut, et al.
Published: (2024-04-01)

Intelligence, emotional intelligence, and emo-sensory intelligence: Which one is a better predictor of university students’ academic success?
by: Reza Pishghadam, et al.
Published: (2022-08-01)

Robust Multi-Scenario Speech-Based Emotion Recognition System
by: Fangfang Zhu-Zhou, et al.
Published: (2022-03-01)

Comparative Study on Feature Selection and Fusion Schemes for Emotion Recognition from Speech
by: Santiago Planet, et al.
Published: (2012-09-01)

Emotion research on education public opinion based on text analysis and deep learning
by: Shulin Niu
Published: (2022-10-01)

improving speech emotion recognition via gender classification
by: Ali Harimi, et al.
Published: (2017-05-01)

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
by: Gintautas Tamulevičius, et al.
Published: (2020-10-01)