Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions

Automatic emotion recognition (AER) systems are burgeoning and systems based on either audio, video, text, or physiological signals have emerged. Multimodal systems, in turn, have shown to improve overall AER accuracy and to also provide some robustness against artifacts and missing data. Collecting...

Descripció completa

Dades bibliogràfiques
Autors principals: Shruti Kshirsagar, Anurag Pendyala, Tiago H. Falk
Format: Article
Idioma:English
Publicat: Frontiers Media S.A. 2023-03-01
Col·lecció:Frontiers in Computer Science
Matèries:
Accés en línia:https://www.frontiersin.org/articles/10.3389/fcomp.2023.1039261/full

Ítems similars