Classification on distressed sounds with CNN/RNN

Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds...

Full description

Bibliographic Details
Main Author: Guo, Xihuang
Other Authors: Gan Woon Seng
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/139181
_version_ 1826113389195165696
author Guo, Xihuang
author2 Gan Woon Seng
author_facet Gan Woon Seng
Guo, Xihuang
author_sort Guo, Xihuang
collection NTU
description Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds using Machine Learning. This project can be integrated with a sound-based security system to help those people who need urgent help or assistance. In this report, it focuses on how to classify a distressed sound using the Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In particular, the dataset was collected for 3 distressed sounds, “Help”, “Crying” and “Screaming”, and then built a model to determine which distressed sound among them. The model to be implemented is a VGG which is widely used in audio classification. The report shows how to convert an audio classification problem to image recognition, where the fully developed techniques of CNN and RNN can be applied better. In the end, the performance of these two networks was evaluated based on several properties. CNN model performs better in overall with 94% training accuracy and 85% testing accuracy.
first_indexed 2024-10-01T03:22:25Z
format Final Year Project (FYP)
id ntu-10356/139181
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:22:25Z
publishDate 2020
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1391812023-07-07T18:52:37Z Classification on distressed sounds with CNN/RNN Guo, Xihuang Gan Woon Seng School of Electrical and Electronic Engineering Smart Nation TRANS Lab Linus Ng Junjia EWSGAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Nowadays, people pay more attention to their personal safety due to the improvements in their quality of life. Imagine if you called for a policeman for help, they would be able to arrive within minutes and that could reduce the chance of crime. This can be done by classifying the distressed sounds using Machine Learning. This project can be integrated with a sound-based security system to help those people who need urgent help or assistance. In this report, it focuses on how to classify a distressed sound using the Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In particular, the dataset was collected for 3 distressed sounds, “Help”, “Crying” and “Screaming”, and then built a model to determine which distressed sound among them. The model to be implemented is a VGG which is widely used in audio classification. The report shows how to convert an audio classification problem to image recognition, where the fully developed techniques of CNN and RNN can be applied better. In the end, the performance of these two networks was evaluated based on several properties. CNN model performs better in overall with 94% training accuracy and 85% testing accuracy. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-05-18T01:43:52Z 2020-05-18T01:43:52Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139181 en application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Guo, Xihuang
Classification on distressed sounds with CNN/RNN
title Classification on distressed sounds with CNN/RNN
title_full Classification on distressed sounds with CNN/RNN
title_fullStr Classification on distressed sounds with CNN/RNN
title_full_unstemmed Classification on distressed sounds with CNN/RNN
title_short Classification on distressed sounds with CNN/RNN
title_sort classification on distressed sounds with cnn rnn
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
url https://hdl.handle.net/10356/139181
work_keys_str_mv AT guoxihuang classificationondistressedsoundswithcnnrnn