Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions
Automatic emotion recognition (AER) systems are burgeoning and systems based on either audio, video, text, or physiological signals have emerged. Multimodal systems, in turn, have shown to improve overall AER accuracy and to also provide some robustness against artifacts and missing data. Collecting...
Những tác giả chính: | Shruti Kshirsagar, Anurag Pendyala, Tiago H. Falk |
---|---|
Định dạng: | Bài viết |
Ngôn ngữ: | English |
Được phát hành: |
Frontiers Media S.A.
2023-03-01
|
Loạt: | Frontiers in Computer Science |
Những chủ đề: | |
Truy cập trực tuyến: | https://www.frontiersin.org/articles/10.3389/fcomp.2023.1039261/full |
Những quyển sách tương tự
-
Cross-Language Speech Emotion Recognition Using Bag-of-Word Representations, Domain Adaptation, and Data Augmentation
Bằng: Shruti Kshirsagar, et al.
Được phát hành: (2022-08-01) -
Multimodal Emotion Recognition Fusion Analysis Adapting BERT With Heterogeneous Feature Unification
Bằng: Sanghyun Lee, et al.
Được phát hành: (2021-01-01) -
Multimodal Emotion Detection via Attention-Based Fusion of Extracted Facial and Speech Features
Bằng: Dilnoza Mamieva, et al.
Được phát hành: (2023-06-01) -
Augmenting Multimodal Content Representation with Transformers for Misinformation Detection
Bằng: Jenq-Haur Wang, et al.
Được phát hành: (2024-10-01) -
Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
Bằng: Jinan Aljawazeri, et al.
Được phát hành: (2024-03-01)