A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform

The subject of this study is methods for improving the efficiency of semantic coding of speech signals. The purpose of this study is to develop a method for improving the efficiency of semantic coding of speech signals. Coding efficiency refers to the reduction of the information transmission rate w...

Full description

Bibliographic Details
Main Authors:	Oleksandr Lavrynenko, Denys Bakhtiiarov, Vitalii Kurushkin, Serhii Zavhorodnii, Veniamin Antonov, Petro Stanko
Format:	Article
Language:	English
Published:	National Aerospace University «Kharkiv Aviation Institute» 2023-09-01
Series:	Радіоелектронні і комп'ютерні системи
Subjects:	semantic features of speech signals mel-frequency cepstral coefficients adaptive spectral analysis empirical wavelet transform adaptive wavelet-filters meyer functions of internal empirical modes hilbert spectral analysis optimal threshold proces
Online Access:	http://nti.khai.edu/ojs/index.php/reks/article/view/2121

_version_	1797649193428320256
author	Oleksandr Lavrynenko Denys Bakhtiiarov Vitalii Kurushkin Serhii Zavhorodnii Veniamin Antonov Petro Stanko
author_facet	Oleksandr Lavrynenko Denys Bakhtiiarov Vitalii Kurushkin Serhii Zavhorodnii Veniamin Antonov Petro Stanko
author_sort	Oleksandr Lavrynenko
collection	DOAJ
description	The subject of this study is methods for improving the efficiency of semantic coding of speech signals. The purpose of this study is to develop a method for improving the efficiency of semantic coding of speech signals. Coding efficiency refers to the reduction of the information transmission rate with a given probability of error-free recognition of semantic features of speech signals, which will significantly reduce the required source bandwidth, thereby increasing the communication channel bandwidth. To achieve this goal, it is necessary to solve the following scientific tasks: (1) to investigate a known method for improving the efficiency of semantic coding of speech signals based on mel-frequency cepstral coefficients; (2) to substantiate the effectiveness of using the adaptive empirical wavelet transform in the tasks of multiple-scale analysis and semantic coding of speech signals; (3) to develop a method of semantic coding of speech signals based on adaptive empirical wavelet transform with further application of Hilbert spectral analysis and optimal thresholding; and (4) to perform an objective quantitative assessment of the increase in the efficiency of the developed method of semantic coding of speech signals in contrast to the existing method. The following scientific results were obtained during the study: a method of semantic coding of speech signals based on empirical wavelet transform is developed for the first time, which differs from existing methods by constructing a set of adaptive bandpass Meyer wavelet filters with further application of Hilbert spectral analysis to find the instantaneous amplitudes and frequencies of the functions of internal empirical modes, which will allow the identification of semantic features of speech signals and increase the efficiency of their coding; for the first time, it is proposed to use the method of adaptive empirical wavelet transform in the tasks of multiple-scale analysis and semantic coding of speech signals, which will increase the efficiency of spectral analysis by decomposing the high-frequency speech oscillation into its low-frequency components, namely internal empirical modes; the method of semantic coding of speech signals based on mel-frequency cepstral coefficients was further developed, but using the basic principles of adaptive spectral analysis with the help of empirical wavelet transform, which increases the efficiency of this method. Conclusions: We developed a method for semantic coding of speech signals based on empirical wavelet transform, which reduces the encoding rate from 320 to 192 bps and the required bandwidth from 40 to 24 Hz with a probability of error-free recognition of approximately 0.96 (96%) and a signal-to-noise ratio of 48 dB, according to which its efficiency is increased by 1.6 times as compared to the existing method. We developed an algorithm for semantic coding of speech signals based on empirical wavelet transform and its software implementation in the MATLAB R2022b programing language.
first_indexed	2024-03-11T15:42:16Z
format	Article
id	doaj.art-84a53e5396a8483984cb448aeaef98ad
institution	Directory Open Access Journal
issn	1814-4225 2663-2012
language	English
last_indexed	2024-03-11T15:42:16Z
publishDate	2023-09-01
publisher	National Aerospace University «Kharkiv Aviation Institute»
record_format	Article
series	Радіоелектронні і комп'ютерні системи
spelling	doaj.art-84a53e5396a8483984cb448aeaef98ad2023-10-26T10:20:02ZengNational Aerospace University «Kharkiv Aviation Institute»Радіоелектронні і комп'ютерні системи1814-42252663-20122023-09-010310112410.32620/reks.2023.3.091991A method for extracting the semantic features of speech signal recognition based on empirical wavelet transformOleksandr Lavrynenko0Denys Bakhtiiarov1Vitalii Kurushkin2Serhii Zavhorodnii3Veniamin Antonov4Petro Stanko5National Aviation University, KyivNational Aviation University, KyivNational Aviation University, KyivNational Aviation University, KyivNational Aviation University, KyivNational Aviation University, KyivThe subject of this study is methods for improving the efficiency of semantic coding of speech signals. The purpose of this study is to develop a method for improving the efficiency of semantic coding of speech signals. Coding efficiency refers to the reduction of the information transmission rate with a given probability of error-free recognition of semantic features of speech signals, which will significantly reduce the required source bandwidth, thereby increasing the communication channel bandwidth. To achieve this goal, it is necessary to solve the following scientific tasks: (1) to investigate a known method for improving the efficiency of semantic coding of speech signals based on mel-frequency cepstral coefficients; (2) to substantiate the effectiveness of using the adaptive empirical wavelet transform in the tasks of multiple-scale analysis and semantic coding of speech signals; (3) to develop a method of semantic coding of speech signals based on adaptive empirical wavelet transform with further application of Hilbert spectral analysis and optimal thresholding; and (4) to perform an objective quantitative assessment of the increase in the efficiency of the developed method of semantic coding of speech signals in contrast to the existing method. The following scientific results were obtained during the study: a method of semantic coding of speech signals based on empirical wavelet transform is developed for the first time, which differs from existing methods by constructing a set of adaptive bandpass Meyer wavelet filters with further application of Hilbert spectral analysis to find the instantaneous amplitudes and frequencies of the functions of internal empirical modes, which will allow the identification of semantic features of speech signals and increase the efficiency of their coding; for the first time, it is proposed to use the method of adaptive empirical wavelet transform in the tasks of multiple-scale analysis and semantic coding of speech signals, which will increase the efficiency of spectral analysis by decomposing the high-frequency speech oscillation into its low-frequency components, namely internal empirical modes; the method of semantic coding of speech signals based on mel-frequency cepstral coefficients was further developed, but using the basic principles of adaptive spectral analysis with the help of empirical wavelet transform, which increases the efficiency of this method. Conclusions: We developed a method for semantic coding of speech signals based on empirical wavelet transform, which reduces the encoding rate from 320 to 192 bps and the required bandwidth from 40 to 24 Hz with a probability of error-free recognition of approximately 0.96 (96%) and a signal-to-noise ratio of 48 dB, according to which its efficiency is increased by 1.6 times as compared to the existing method. We developed an algorithm for semantic coding of speech signals based on empirical wavelet transform and its software implementation in the MATLAB R2022b programing language.http://nti.khai.edu/ojs/index.php/reks/article/view/2121semantic features of speech signalsmel-frequency cepstral coefficientsadaptive spectral analysisempirical wavelet transformadaptive wavelet-filters meyerfunctions of internal empirical modeshilbert spectral analysisoptimal threshold proces
spellingShingle	Oleksandr Lavrynenko Denys Bakhtiiarov Vitalii Kurushkin Serhii Zavhorodnii Veniamin Antonov Petro Stanko A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform Радіоелектронні і комп'ютерні системи semantic features of speech signals mel-frequency cepstral coefficients adaptive spectral analysis empirical wavelet transform adaptive wavelet-filters meyer functions of internal empirical modes hilbert spectral analysis optimal threshold proces
title	A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
title_full	A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
title_fullStr	A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
title_full_unstemmed	A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
title_short	A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
title_sort	method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
topic	semantic features of speech signals mel-frequency cepstral coefficients adaptive spectral analysis empirical wavelet transform adaptive wavelet-filters meyer functions of internal empirical modes hilbert spectral analysis optimal threshold proces
url	http://nti.khai.edu/ojs/index.php/reks/article/view/2121
work_keys_str_mv	AT oleksandrlavrynenko amethodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT denysbakhtiiarov amethodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT vitaliikurushkin amethodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT serhiizavhorodnii amethodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT veniaminantonov amethodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT petrostanko amethodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT oleksandrlavrynenko methodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT denysbakhtiiarov methodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT vitaliikurushkin methodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT serhiizavhorodnii methodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT veniaminantonov methodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform AT petrostanko methodforextractingthesemanticfeaturesofspeechsignalrecognitionbasedonempiricalwavelettransform

A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform

Similar Items