Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions

Automatic emotion recognition (AER) systems are burgeoning and systems based on either audio, video, text, or physiological signals have emerged. Multimodal systems, in turn, have shown to improve overall AER accuracy and to also provide some robustness against artifacts and missing data. Collecting...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Shruti Kshirsagar, Anurag Pendyala, Tiago H. Falk
التنسيق: مقال
اللغة:English
منشور في: Frontiers Media S.A. 2023-03-01
سلاسل:Frontiers in Computer Science
الموضوعات:
الوصول للمادة أونلاين:https://www.frontiersin.org/articles/10.3389/fcomp.2023.1039261/full

مواد مشابهة