Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions
Automatic emotion recognition (AER) systems are burgeoning and systems based on either audio, video, text, or physiological signals have emerged. Multimodal systems, in turn, have shown to improve overall AER accuracy and to also provide some robustness against artifacts and missing data. Collecting...
Autors principals: | Shruti Kshirsagar, Anurag Pendyala, Tiago H. Falk |
---|---|
Format: | Article |
Idioma: | English |
Publicat: |
Frontiers Media S.A.
2023-03-01
|
Col·lecció: | Frontiers in Computer Science |
Matèries: | |
Accés en línia: | https://www.frontiersin.org/articles/10.3389/fcomp.2023.1039261/full |
Ítems similars
-
Cross-Language Speech Emotion Recognition Using Bag-of-Word Representations, Domain Adaptation, and Data Augmentation
per: Shruti Kshirsagar, et al.
Publicat: (2022-08-01) -
Multimodal Emotion Recognition Fusion Analysis Adapting BERT With Heterogeneous Feature Unification
per: Sanghyun Lee, et al.
Publicat: (2021-01-01) -
Multimodal Emotion Detection via Attention-Based Fusion of Extracted Facial and Speech Features
per: Dilnoza Mamieva, et al.
Publicat: (2023-06-01) -
Augmenting Multimodal Content Representation with Transformers for Misinformation Detection
per: Jenq-Haur Wang, et al.
Publicat: (2024-10-01) -
Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
per: Jinan Aljawazeri, et al.
Publicat: (2024-03-01)