Classification on distressed sounds with CNN/RNN

Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds...

Full description

Bibliographic Details
Main Author:	Guo, Xihuang
Other Authors:	Gan Woon Seng
Format:	Final Year Project (FYP)
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/139181

_version_	1826113389195165696
author	Guo, Xihuang
author2	Gan Woon Seng
author_facet	Gan Woon Seng Guo, Xihuang
author_sort	Guo, Xihuang
collection	NTU
description	Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds using Machine Learning. This project can be integrated with a sound-based security system to help those people who need urgent help or assistance. In this report, it focuses on how to classify a distressed sound using the Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In particular, the dataset was collected for 3 distressed sounds, “Help”, “Crying” and “Screaming”, and then built a model to determine which distressed sound among them. The model to be implemented is a VGG which is widely used in audio classification. The report shows how to convert an audio classification problem to image recognition, where the fully developed techniques of CNN and RNN can be applied better. In the end, the performance of these two networks was evaluated based on several properties. CNN model performs better in overall with 94% training accuracy and 85% testing accuracy.
first_indexed	2024-10-01T03:22:25Z
format	Final Year Project (FYP)
id	ntu-10356/139181
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T03:22:25Z
publishDate	2020
publisher	Nanyang Technological University
record_format	dspace
spelling	ntu-10356/1391812023-07-07T18:52:37Z Classification on distressed sounds with CNN/RNN Guo, Xihuang Gan Woon Seng School of Electrical and Electronic Engineering Smart Nation TRANS Lab Linus Ng Junjia EWSGAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds using Machine Learning. This project can be integrated with a sound-based security system to help those people who need urgent help or assistance. In this report, it focuses on how to classify a distressed sound using the Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In particular, the dataset was collected for 3 distressed sounds, “Help”, “Crying” and “Screaming”, and then built a model to determine which distressed sound among them. The model to be implemented is a VGG which is widely used in audio classification. The report shows how to convert an audio classification problem to image recognition, where the fully developed techniques of CNN and RNN can be applied better. In the end, the performance of these two networks was evaluated based on several properties. CNN model performs better in overall with 94% training accuracy and 85% testing accuracy. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-05-18T01:43:52Z 2020-05-18T01:43:52Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139181 en application/pdf Nanyang Technological University
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Guo, Xihuang Classification on distressed sounds with CNN/RNN
title	Classification on distressed sounds with CNN/RNN
title_full	Classification on distressed sounds with CNN/RNN
title_fullStr	Classification on distressed sounds with CNN/RNN
title_full_unstemmed	Classification on distressed sounds with CNN/RNN
title_short	Classification on distressed sounds with CNN/RNN
title_sort	classification on distressed sounds with cnn rnn
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
url	https://hdl.handle.net/10356/139181
work_keys_str_mv	AT guoxihuang classificationondistressedsoundswithcnnrnn

Classification on distressed sounds with CNN/RNN

Similar Items