Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory

In recent years, a lot of researchers’ attentions were concentrating on imaginary speech understanding, decoding, and even recognition. Speech is a complex mechanism, which involves multiple brain areas in the process of production, planning, and precise control of a large number of muscles and arti...

Full description

Bibliographic Details
Main Authors: Ana-Luiza Rusnac, Ovidiu Grigore
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/22/11873
_version_ 1827644969718906880
author Ana-Luiza Rusnac
Ovidiu Grigore
author_facet Ana-Luiza Rusnac
Ovidiu Grigore
author_sort Ana-Luiza Rusnac
collection DOAJ
description In recent years, a lot of researchers’ attentions were concentrating on imaginary speech understanding, decoding, and even recognition. Speech is a complex mechanism, which involves multiple brain areas in the process of production, planning, and precise control of a large number of muscles and articulation involved in the actual utterance. This paper proposes an intelligent imaginary speech recognition system of eleven different utterances, seven phonemes, and four words from the Kara One database. We showed, during our research, that the feature space of the cross-covariance in frequency domain offers a better perspective of the imaginary speech by computing LDA for 2D representation of the feature space, in comparison to cross-covariance in the time domain and the raw signals without any processing. In the classification stage, we used a CNNLSTM neural network and obtained a performance of 43% accuracy for all eleven different utterances. The developed system was meant to be a subject’s shared system. We also showed that, using the channels corresponding to the anatomical structures of the brain involved in speech production, i.e., Broca area, primary motor cortex, and secondary motor cortex, 93% of information is preserved, obtaining 40% accuracy by using 29 electrodes out of the initial 62.
first_indexed 2024-03-09T18:28:56Z
format Article
id doaj.art-06818926b68b43acb6c3718904e53575
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T18:28:56Z
publishDate 2022-11-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-06818926b68b43acb6c3718904e535752023-11-24T07:42:18ZengMDPI AGApplied Sciences2076-34172022-11-0112221187310.3390/app122211873Imaginary Speech Recognition Using a Convolutional Network with Long-Short MemoryAna-Luiza Rusnac0Ovidiu Grigore1Department of Applied Electronics and Information Engineering, Faculty of Electronics, Telecommunications and Information Technology, Polytechnic University of Bucharest, 060042 Bucharest, RomaniaDepartment of Applied Electronics and Information Engineering, Faculty of Electronics, Telecommunications and Information Technology, Polytechnic University of Bucharest, 060042 Bucharest, RomaniaIn recent years, a lot of researchers’ attentions were concentrating on imaginary speech understanding, decoding, and even recognition. Speech is a complex mechanism, which involves multiple brain areas in the process of production, planning, and precise control of a large number of muscles and articulation involved in the actual utterance. This paper proposes an intelligent imaginary speech recognition system of eleven different utterances, seven phonemes, and four words from the Kara One database. We showed, during our research, that the feature space of the cross-covariance in frequency domain offers a better perspective of the imaginary speech by computing LDA for 2D representation of the feature space, in comparison to cross-covariance in the time domain and the raw signals without any processing. In the classification stage, we used a CNNLSTM neural network and obtained a performance of 43% accuracy for all eleven different utterances. The developed system was meant to be a subject’s shared system. We also showed that, using the channels corresponding to the anatomical structures of the brain involved in speech production, i.e., Broca area, primary motor cortex, and secondary motor cortex, 93% of information is preserved, obtaining 40% accuracy by using 29 electrodes out of the initial 62.https://www.mdpi.com/2076-3417/12/22/11873imaginary speech recognitionelectroencephalographylong-short term memory neural networkconvolutional neural networkcross-covariance featuresspeech brain areas
spellingShingle Ana-Luiza Rusnac
Ovidiu Grigore
Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory
Applied Sciences
imaginary speech recognition
electroencephalography
long-short term memory neural network
convolutional neural network
cross-covariance features
speech brain areas
title Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory
title_full Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory
title_fullStr Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory
title_full_unstemmed Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory
title_short Imaginary Speech Recognition Using a Convolutional Network with Long-Short Memory
title_sort imaginary speech recognition using a convolutional network with long short memory
topic imaginary speech recognition
electroencephalography
long-short term memory neural network
convolutional neural network
cross-covariance features
speech brain areas
url https://www.mdpi.com/2076-3417/12/22/11873
work_keys_str_mv AT analuizarusnac imaginaryspeechrecognitionusingaconvolutionalnetworkwithlongshortmemory
AT ovidiugrigore imaginaryspeechrecognitionusingaconvolutionalnetworkwithlongshortmemory