Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition

This article addresses observation duplication and lack of whole picture problems for ensemble learning with the attention model integrated convolutional recurrent neural network (ACRNN) in imbalanced speech emotion recognition. Firstly, we introduce Bagging with ACRNN and the observation duplicatio...

Full description

Bibliographic Details
Main Authors: Xusheng Ai, Victor S. Sheng, Wei Fang, Charles X. Ling, Chunhua Li
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9247946/
_version_ 1831746453947547648
author Xusheng Ai
Victor S. Sheng
Wei Fang
Charles X. Ling
Chunhua Li
author_facet Xusheng Ai
Victor S. Sheng
Wei Fang
Charles X. Ling
Chunhua Li
author_sort Xusheng Ai
collection DOAJ
description This article addresses observation duplication and lack of whole picture problems for ensemble learning with the attention model integrated convolutional recurrent neural network (ACRNN) in imbalanced speech emotion recognition. Firstly, we introduce Bagging with ACRNN and the observation duplication problem. Then Redagging is devised and proved to address the observation duplication problem by generating bootstrap samples from permutations of observations. Moreover, Augagging is proposed to get oversampling learner to participate in majority voting for addressing the lack of whole picture problem. Finally, Extensive experiments on IEMOCAP and Emo-DB samples demonstrate the superiority of our proposed methods (i.e., Redagging and Augagging).
first_indexed 2024-12-21T21:15:30Z
format Article
id doaj.art-fa744630c3be4698a8fb3a348ef8262d
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-21T21:15:30Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-fa744630c3be4698a8fb3a348ef8262d2022-12-21T18:50:01ZengIEEEIEEE Access2169-35362020-01-01819990919991910.1109/ACCESS.2020.30359109247946Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion RecognitionXusheng Ai0https://orcid.org/0000-0001-5629-9134Victor S. Sheng1https://orcid.org/0000-0003-4960-174XWei Fang2https://orcid.org/0000-0002-5561-7249Charles X. Ling3Chunhua Li4https://orcid.org/0000-0002-5703-774XSoftware and Service Outsourcing College, Suzhou Vocational Institute of Industrial Technology, Suzhou, ChinaDepartment of Computer Science, Texas Tech University, Lubbock, TX, USAJiangsu Engineering Center of Network Monitoring, School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, ChinaDepartment of Computer Science, Western University, London, ON, CanadaSoftware and Service Outsourcing College, Suzhou Vocational Institute of Industrial Technology, Suzhou, ChinaThis article addresses observation duplication and lack of whole picture problems for ensemble learning with the attention model integrated convolutional recurrent neural network (ACRNN) in imbalanced speech emotion recognition. Firstly, we introduce Bagging with ACRNN and the observation duplication problem. Then Redagging is devised and proved to address the observation duplication problem by generating bootstrap samples from permutations of observations. Moreover, Augagging is proposed to get oversampling learner to participate in majority voting for addressing the lack of whole picture problem. Finally, Extensive experiments on IEMOCAP and Emo-DB samples demonstrate the superiority of our proposed methods (i.e., Redagging and Augagging).https://ieeexplore.ieee.org/document/9247946/Imbalance learningensemble learningconvolutional neural networkrecurrent neural networkspeech emotion recognition
spellingShingle Xusheng Ai
Victor S. Sheng
Wei Fang
Charles X. Ling
Chunhua Li
Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition
IEEE Access
Imbalance learning
ensemble learning
convolutional neural network
recurrent neural network
speech emotion recognition
title Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition
title_full Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition
title_fullStr Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition
title_full_unstemmed Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition
title_short Ensemble Learning With Attention-Integrated Convolutional Recurrent Neural Network for Imbalanced Speech Emotion Recognition
title_sort ensemble learning with attention integrated convolutional recurrent neural network for imbalanced speech emotion recognition
topic Imbalance learning
ensemble learning
convolutional neural network
recurrent neural network
speech emotion recognition
url https://ieeexplore.ieee.org/document/9247946/
work_keys_str_mv AT xushengai ensemblelearningwithattentionintegratedconvolutionalrecurrentneuralnetworkforimbalancedspeechemotionrecognition
AT victorssheng ensemblelearningwithattentionintegratedconvolutionalrecurrentneuralnetworkforimbalancedspeechemotionrecognition
AT weifang ensemblelearningwithattentionintegratedconvolutionalrecurrentneuralnetworkforimbalancedspeechemotionrecognition
AT charlesxling ensemblelearningwithattentionintegratedconvolutionalrecurrentneuralnetworkforimbalancedspeechemotionrecognition
AT chunhuali ensemblelearningwithattentionintegratedconvolutionalrecurrentneuralnetworkforimbalancedspeechemotionrecognition