Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions
Automatic emotion recognition (AER) systems are burgeoning and systems based on either audio, video, text, or physiological signals have emerged. Multimodal systems, in turn, have shown to improve overall AER accuracy and to also provide some robustness against artifacts and missing data. Collecting...
المؤلفون الرئيسيون: | Shruti Kshirsagar, Anurag Pendyala, Tiago H. Falk |
---|---|
التنسيق: | مقال |
اللغة: | English |
منشور في: |
Frontiers Media S.A.
2023-03-01
|
سلاسل: | Frontiers in Computer Science |
الموضوعات: | |
الوصول للمادة أونلاين: | https://www.frontiersin.org/articles/10.3389/fcomp.2023.1039261/full |
مواد مشابهة
-
Cross-Language Speech Emotion Recognition Using Bag-of-Word Representations, Domain Adaptation, and Data Augmentation
حسب: Shruti Kshirsagar, وآخرون
منشور في: (2022-08-01) -
Multimodal Emotion Recognition Fusion Analysis Adapting BERT With Heterogeneous Feature Unification
حسب: Sanghyun Lee, وآخرون
منشور في: (2021-01-01) -
Multimodal Emotion Detection via Attention-Based Fusion of Extracted Facial and Speech Features
حسب: Dilnoza Mamieva, وآخرون
منشور في: (2023-06-01) -
Augmenting Multimodal Content Representation with Transformers for Misinformation Detection
حسب: Jenq-Haur Wang, وآخرون
منشور في: (2024-10-01) -
Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
حسب: Jinan Aljawazeri, وآخرون
منشور في: (2024-03-01)