Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)

The consequences of the coronavirus called COVID-19 have been really impactful on society. Many things need to be changed in order to survive this pandemic. People have to avoid physical contact to minimize the probability of getting caught by other people who have been infected. A doorknob has a r...

Full description

Bibliographic Details
Main Authors: Blessius Sheldo Putra Laksono, Tio Syaifuddin, Fitri Utaminingrum
Format: Article
Language:English
Published: University of Brawijaya 2024-04-01
Series:JITeCS (Journal of Information Technology and Computer Science)
Online Access:https://jitecs.ub.ac.id/index.php/jitecs/article/view/579
_version_ 1797226229439397888
author Blessius Sheldo Putra Laksono
Tio Syaifuddin
Fitri Utaminingrum
author_facet Blessius Sheldo Putra Laksono
Tio Syaifuddin
Fitri Utaminingrum
author_sort Blessius Sheldo Putra Laksono
collection DOAJ
description The consequences of the coronavirus called COVID-19 have been really impactful on society. Many things need to be changed in order to survive this pandemic. People have to avoid physical contact to minimize the probability of getting caught by other people who have been infected. A doorknob has a really big potential to be the medium to spread the virus because the same surface is used by several people. Speech recognition can be used to solve this problem. In this study, Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN) are going to be used as the extraction feature and classification method, respectively. We classify the sound signal into two classes (“buka” and “tutup”). People who want to open or close the door just need to say a specific command. This can be helpful to minimize the risk of COVID transmission. A CNN model is developed and fed with an audio file from a curated dataset for training and testing. With this system, we have successfully trained the model with an accuracy of 89% using an epoch of 50 and batch size of 32 as the parameters with a dataset distribution of 8:2 for training and validation. We believe this study will be influential in developing automated door systems using speech recognition, especially in the Indonesian language.
first_indexed 2024-04-24T14:21:35Z
format Article
id doaj.art-849016418a1f4731949232b1dcfd14bb
institution Directory Open Access Journal
issn 2540-9433
2540-9824
language English
last_indexed 2024-04-24T14:21:35Z
publishDate 2024-04-01
publisher University of Brawijaya
record_format Article
series JITeCS (Journal of Information Technology and Computer Science)
spelling doaj.art-849016418a1f4731949232b1dcfd14bb2024-04-03T06:33:28ZengUniversity of BrawijayaJITeCS (Journal of Information Technology and Computer Science)2540-94332540-98242024-04-019110.25126/jitecs.202491579Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)Blessius Sheldo Putra Laksono0Tio Syaifuddin1Fitri Utaminingrum2Brawijaya University, MalangBrawijaya University, MalangBrawijaya University, Malang The consequences of the coronavirus called COVID-19 have been really impactful on society. Many things need to be changed in order to survive this pandemic. People have to avoid physical contact to minimize the probability of getting caught by other people who have been infected. A doorknob has a really big potential to be the medium to spread the virus because the same surface is used by several people. Speech recognition can be used to solve this problem. In this study, Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN) are going to be used as the extraction feature and classification method, respectively. We classify the sound signal into two classes (“buka” and “tutup”). People who want to open or close the door just need to say a specific command. This can be helpful to minimize the risk of COVID transmission. A CNN model is developed and fed with an audio file from a curated dataset for training and testing. With this system, we have successfully trained the model with an accuracy of 89% using an epoch of 50 and batch size of 32 as the parameters with a dataset distribution of 8:2 for training and validation. We believe this study will be influential in developing automated door systems using speech recognition, especially in the Indonesian language. https://jitecs.ub.ac.id/index.php/jitecs/article/view/579
spellingShingle Blessius Sheldo Putra Laksono
Tio Syaifuddin
Fitri Utaminingrum
Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
JITeCS (Journal of Information Technology and Computer Science)
title Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_full Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_fullStr Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_full_unstemmed Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_short Voice Recognition to Classify “Buka” and “Tutup” Sound to Open and Closes Door Using Mel Frequency Cepstral Coefficients (MFCC) and Convolutional Neural Network (CNN)
title_sort voice recognition to classify buka and tutup sound to open and closes door using mel frequency cepstral coefficients mfcc and convolutional neural network cnn
url https://jitecs.ub.ac.id/index.php/jitecs/article/view/579
work_keys_str_mv AT blessiussheldoputralaksono voicerecognitiontoclassifybukaandtutupsoundtoopenandclosesdoorusingmelfrequencycepstralcoefficientsmfccandconvolutionalneuralnetworkcnn
AT tiosyaifuddin voicerecognitiontoclassifybukaandtutupsoundtoopenandclosesdoorusingmelfrequencycepstralcoefficientsmfccandconvolutionalneuralnetworkcnn
AT fitriutaminingrum voicerecognitiontoclassifybukaandtutupsoundtoopenandclosesdoorusingmelfrequencycepstralcoefficientsmfccandconvolutionalneuralnetworkcnn