Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)

The consequences of the coronavirus called COVID-19 have been really impactful on society. Many things need to be changed in order to survive this pandemic. People have to avoid physical contact to minimize the probability of getting caught by other people who have been infected. A doorknob has a r...

Full description

Bibliographic Details
Main Authors:	Blessius Sheldo Putra Laksono, Tio Syaifuddin, Fitri Utaminingrum
Format:	Article
Language:	English
Published:	University of Brawijaya 2024-04-01
Series:	JITeCS (Journal of Information Technology and Computer Science)
Online Access:	https://jitecs.ub.ac.id/index.php/jitecs/article/view/579

_version_	1797226229439397888
author	Blessius Sheldo Putra Laksono Tio Syaifuddin Fitri Utaminingrum
author_facet	Blessius Sheldo Putra Laksono Tio Syaifuddin Fitri Utaminingrum
author_sort	Blessius Sheldo Putra Laksono
collection	DOAJ
description	The consequences of the coronavirus called COVID-19 have been really impactful on society. Many things need to be changed in order to survive this pandemic. People have to avoid physical contact to minimize the probability of getting caught by other people who have been infected. A doorknob has a really big potential to be the medium to spread the virus because the same surface is used by several people. Speech recognition can be used to solve this problem. In this study, Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN) are going to be used as the extraction feature and classification method, respectively. We classify the sound signal into two classes (“buka” and “tutup”). People who want to open or close the door just need to say a specific command. This can be helpful to minimize the risk of COVID transmission. A CNN model is developed and fed with an audio file from a curated dataset for training and testing. With this system, we have successfully trained the model with an accuracy of 89% using an epoch of 50 and batch size of 32 as the parameters with a dataset distribution of 8:2 for training and validation. We believe this study will be influential in developing automated door systems using speech recognition, especially in the Indonesian language.
first_indexed	2024-04-24T14:21:35Z
format	Article
id	doaj.art-849016418a1f4731949232b1dcfd14bb
institution	Directory Open Access Journal
issn	2540-9433 2540-9824
language	English
last_indexed	2024-04-24T14:21:35Z
publishDate	2024-04-01
publisher	University of Brawijaya
record_format	Article
series	JITeCS (Journal of Information Technology and Computer Science)
spelling	doaj.art-849016418a1f4731949232b1dcfd14bb2024-04-03T06:33:28ZengUniversity of BrawijayaJITeCS (Journal of Information Technology and Computer Science)2540-94332540-98242024-04-019110.25126/jitecs.202491579Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)Blessius Sheldo Putra Laksono0Tio Syaifuddin1Fitri Utaminingrum2Brawijaya University, MalangBrawijaya University, MalangBrawijaya University, Malang The consequences of the coronavirus called COVID-19 have been really impactful on society. Many things need to be changed in order to survive this pandemic. People have to avoid physical contact to minimize the probability of getting caught by other people who have been infected. A doorknob has a really big potential to be the medium to spread the virus because the same surface is used by several people. Speech recognition can be used to solve this problem. In this study, Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN) are going to be used as the extraction feature and classification method, respectively. We classify the sound signal into two classes (“buka” and “tutup”). People who want to open or close the door just need to say a specific command. This can be helpful to minimize the risk of COVID transmission. A CNN model is developed and fed with an audio file from a curated dataset for training and testing. With this system, we have successfully trained the model with an accuracy of 89% using an epoch of 50 and batch size of 32 as the parameters with a dataset distribution of 8:2 for training and validation. We believe this study will be influential in developing automated door systems using speech recognition, especially in the Indonesian language. https://jitecs.ub.ac.id/index.php/jitecs/article/view/579
spellingShingle	Blessius Sheldo Putra Laksono Tio Syaifuddin Fitri Utaminingrum Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN) JITeCS (Journal of Information Technology and Computer Science)
title	Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_full	Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_fullStr	Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_full_unstemmed	Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_short	Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_sort	voice recognition to classify buka and tutup sound to open and closes door using mel frequency cepstral coefficients mfcc and convolutional neural network cnn
url	https://jitecs.ub.ac.id/index.php/jitecs/article/view/579
work_keys_str_mv	AT blessiussheldoputralaksono voicerecognitiontoclassifybukaandtutupsoundtoopenandclosesdoorusingmelfrequencycepstralcoefficientsmfccandconvolutionalneuralnetworkcnn AT tiosyaifuddin voicerecognitiontoclassifybukaandtutupsoundtoopenandclosesdoorusingmelfrequencycepstralcoefficientsmfccandconvolutionalneuralnetworkcnn AT fitriutaminingrum voicerecognitiontoclassifybukaandtutupsoundtoopenandclosesdoorusingmelfrequencycepstralcoefficientsmfccandconvolutionalneuralnetworkcnn

Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)

Similar Items