Emotional Speech Recognition Method Based on Word Transcription

The emotional speech recognition method presented in this article was applied to recognize the emotions of students during online exams in distance learning due to COVID-19. The purpose of this method is to recognize emotions in spoken speech through the knowledge base of emotionally charged words,...

Full description

Bibliographic Details
Main Authors: Gulmira Bekmanova, Banu Yergesh, Altynbek Sharipbay, Assel Mukanova
Format: Article
Language:English
Published: MDPI AG 2022-03-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/5/1937
_version_ 1827650619722170368
author Gulmira Bekmanova
Banu Yergesh
Altynbek Sharipbay
Assel Mukanova
author_facet Gulmira Bekmanova
Banu Yergesh
Altynbek Sharipbay
Assel Mukanova
author_sort Gulmira Bekmanova
collection DOAJ
description The emotional speech recognition method presented in this article was applied to recognize the emotions of students during online exams in distance learning due to COVID-19. The purpose of this method is to recognize emotions in spoken speech through the knowledge base of emotionally charged words, which are stored as a code book. The method analyzes human speech for the presence of emotions. To assess the quality of the method, an experiment was conducted for 420 audio recordings. The accuracy of the proposed method is 79.7% for the Kazakh language. The method can be used for different languages and consists of the following tasks: capturing a signal, detecting speech in it, recognizing speech words in a simplified transcription, determining word boundaries, comparing a simplified transcription with a code book, and constructing a hypothesis about the degree of speech emotionality. In case of the presence of emotions, there occurs complete recognition of words and definitions of emotions in speech. The advantage of this method is the possibility of its widespread use since it is not demanding on computational resources. The described method can be applied when there is a need to recognize positive and negative emotions in a crowd, in public transport, schools, universities, etc. The experiment carried out has shown the effectiveness of this method. The results obtained will make it possible in the future to develop devices that begin to record and recognize a speech signal, for example, in the case of detecting negative emotions in sounding speech and, if necessary, transmitting a message about potential threats or riots.
first_indexed 2024-03-09T20:21:37Z
format Article
id doaj.art-b13146f441eb4e08a0b636557f63ee7c
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T20:21:37Z
publishDate 2022-03-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-b13146f441eb4e08a0b636557f63ee7c2023-11-23T23:48:37ZengMDPI AGSensors1424-82202022-03-01225193710.3390/s22051937Emotional Speech Recognition Method Based on Word TranscriptionGulmira Bekmanova0Banu Yergesh1Altynbek Sharipbay2Assel Mukanova3Faculty of Information Technologies, L.N. Gumilyov Eurasian National University, Nur-Sultan 010008, KazakhstanFaculty of Information Technologies, L.N. Gumilyov Eurasian National University, Nur-Sultan 010008, KazakhstanFaculty of Information Technologies, L.N. Gumilyov Eurasian National University, Nur-Sultan 010008, KazakhstanFaculty of Information Technologies, L.N. Gumilyov Eurasian National University, Nur-Sultan 010008, KazakhstanThe emotional speech recognition method presented in this article was applied to recognize the emotions of students during online exams in distance learning due to COVID-19. The purpose of this method is to recognize emotions in spoken speech through the knowledge base of emotionally charged words, which are stored as a code book. The method analyzes human speech for the presence of emotions. To assess the quality of the method, an experiment was conducted for 420 audio recordings. The accuracy of the proposed method is 79.7% for the Kazakh language. The method can be used for different languages and consists of the following tasks: capturing a signal, detecting speech in it, recognizing speech words in a simplified transcription, determining word boundaries, comparing a simplified transcription with a code book, and constructing a hypothesis about the degree of speech emotionality. In case of the presence of emotions, there occurs complete recognition of words and definitions of emotions in speech. The advantage of this method is the possibility of its widespread use since it is not demanding on computational resources. The described method can be applied when there is a need to recognize positive and negative emotions in a crowd, in public transport, schools, universities, etc. The experiment carried out has shown the effectiveness of this method. The results obtained will make it possible in the future to develop devices that begin to record and recognize a speech signal, for example, in the case of detecting negative emotions in sounding speech and, if necessary, transmitting a message about potential threats or riots.https://www.mdpi.com/1424-8220/22/5/1937emotion recognitionspeech recognitioncrowd emotion recognitionaffective computingdistance learninge-learning
spellingShingle Gulmira Bekmanova
Banu Yergesh
Altynbek Sharipbay
Assel Mukanova
Emotional Speech Recognition Method Based on Word Transcription
Sensors
emotion recognition
speech recognition
crowd emotion recognition
affective computing
distance learning
e-learning
title Emotional Speech Recognition Method Based on Word Transcription
title_full Emotional Speech Recognition Method Based on Word Transcription
title_fullStr Emotional Speech Recognition Method Based on Word Transcription
title_full_unstemmed Emotional Speech Recognition Method Based on Word Transcription
title_short Emotional Speech Recognition Method Based on Word Transcription
title_sort emotional speech recognition method based on word transcription
topic emotion recognition
speech recognition
crowd emotion recognition
affective computing
distance learning
e-learning
url https://www.mdpi.com/1424-8220/22/5/1937
work_keys_str_mv AT gulmirabekmanova emotionalspeechrecognitionmethodbasedonwordtranscription
AT banuyergesh emotionalspeechrecognitionmethodbasedonwordtranscription
AT altynbeksharipbay emotionalspeechrecognitionmethodbasedonwordtranscription
AT asselmukanova emotionalspeechrecognitionmethodbasedonwordtranscription