A Survey of Deep Learning-Based Multimodal Emotion Recognition: Speech, Text, and Face

Multimodal emotion recognition (MER) refers to the identification and understanding of human emotional states by combining different signals, including—but not limited to—text, speech, and face cues. MER plays a crucial role in the human–computer interaction (HCI) domain. With the recent progression...

Full description

Bibliographic Details
Main Authors: Hailun Lian, Cheng Lu, Sunan Li, Yan Zhao, Chuangao Tang, Yuan Zong
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/25/10/1440