Improving CNN-based solutions for emotion recognition using evolutionary algorithms

AI-based approaches, especially deep learning have made remarkable achievements in Speech Emotion Recognition (SER). Needless to say, Convolutional Neural Networks (CNNs) have been the backbone of many of these solutions. Although the use of CNNs have resulted in high performing models, building the...

Full description

Bibliographic Details
Main Authors: Parsa Mohammadrezaei, Mohammad Aminan, Mohammad Soltanian, Keivan Borna
Format: Article
Language:English
Published: Elsevier 2023-05-01
Series:Results in Applied Mathematics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2590037423000067
_version_ 1797815485434167296
author Parsa Mohammadrezaei
Mohammad Aminan
Mohammad Soltanian
Keivan Borna
author_facet Parsa Mohammadrezaei
Mohammad Aminan
Mohammad Soltanian
Keivan Borna
author_sort Parsa Mohammadrezaei
collection DOAJ
description AI-based approaches, especially deep learning have made remarkable achievements in Speech Emotion Recognition (SER). Needless to say, Convolutional Neural Networks (CNNs) have been the backbone of many of these solutions. Although the use of CNNs have resulted in high performing models, building them needs domain knowledge and direct human intervention. The same issue arises while improving a model. To solve this problem, we use techniques that were firstly introduced in Neural Architecture Search (NAS) and use a genetic process to search for models with improved accuracy. More specifically, we insert blocks with dynamic structures in between the layers of an already existing model and then use genetic operations (i.e. selection, mutation, and crossover) to find the best performing structures. To validate our method, we use this algorithm to improve architectures by searching on the Berlin Database of Emotional Speech (EMODB). The experimental results show at least 1.7% performance improvement in terms of Accuracy on EMODB test set.
first_indexed 2024-03-13T08:23:25Z
format Article
id doaj.art-1ba2574996df485fa9e192a8b367a373
institution Directory Open Access Journal
issn 2590-0374
language English
last_indexed 2024-03-13T08:23:25Z
publishDate 2023-05-01
publisher Elsevier
record_format Article
series Results in Applied Mathematics
spelling doaj.art-1ba2574996df485fa9e192a8b367a3732023-05-31T04:47:30ZengElsevierResults in Applied Mathematics2590-03742023-05-0118100360Improving CNN-based solutions for emotion recognition using evolutionary algorithmsParsa Mohammadrezaei0Mohammad Aminan1Mohammad Soltanian2Keivan Borna3Faculty of Mathematical Sciences and Computer, Kharazmi University, Tehran, IranFaculty of Mathematical Sciences and Computer, Kharazmi University, Tehran, IranFaculty of Mathematical Sciences and Computer, Kharazmi University, Tehran, IranCorresponding author.; Faculty of Mathematical Sciences and Computer, Kharazmi University, Tehran, IranAI-based approaches, especially deep learning have made remarkable achievements in Speech Emotion Recognition (SER). Needless to say, Convolutional Neural Networks (CNNs) have been the backbone of many of these solutions. Although the use of CNNs have resulted in high performing models, building them needs domain knowledge and direct human intervention. The same issue arises while improving a model. To solve this problem, we use techniques that were firstly introduced in Neural Architecture Search (NAS) and use a genetic process to search for models with improved accuracy. More specifically, we insert blocks with dynamic structures in between the layers of an already existing model and then use genetic operations (i.e. selection, mutation, and crossover) to find the best performing structures. To validate our method, we use this algorithm to improve architectures by searching on the Berlin Database of Emotional Speech (EMODB). The experimental results show at least 1.7% performance improvement in terms of Accuracy on EMODB test set.http://www.sciencedirect.com/science/article/pii/S2590037423000067Convolutional Neural NetworksDeep learningGenetic algorithmsEvolutionary optimization
spellingShingle Parsa Mohammadrezaei
Mohammad Aminan
Mohammad Soltanian
Keivan Borna
Improving CNN-based solutions for emotion recognition using evolutionary algorithms
Results in Applied Mathematics
Convolutional Neural Networks
Deep learning
Genetic algorithms
Evolutionary optimization
title Improving CNN-based solutions for emotion recognition using evolutionary algorithms
title_full Improving CNN-based solutions for emotion recognition using evolutionary algorithms
title_fullStr Improving CNN-based solutions for emotion recognition using evolutionary algorithms
title_full_unstemmed Improving CNN-based solutions for emotion recognition using evolutionary algorithms
title_short Improving CNN-based solutions for emotion recognition using evolutionary algorithms
title_sort improving cnn based solutions for emotion recognition using evolutionary algorithms
topic Convolutional Neural Networks
Deep learning
Genetic algorithms
Evolutionary optimization
url http://www.sciencedirect.com/science/article/pii/S2590037423000067
work_keys_str_mv AT parsamohammadrezaei improvingcnnbasedsolutionsforemotionrecognitionusingevolutionaryalgorithms
AT mohammadaminan improvingcnnbasedsolutionsforemotionrecognitionusingevolutionaryalgorithms
AT mohammadsoltanian improvingcnnbasedsolutionsforemotionrecognitionusingevolutionaryalgorithms
AT keivanborna improvingcnnbasedsolutionsforemotionrecognitionusingevolutionaryalgorithms