Curating a strongly labelled urban sound dataset for deep neural network training
The success of deep learning relies on massive training data. However, obtaining large-scale labeled data is not easy, which is expensive and time-consuming. Addressing the complexities of urban soundscapes, this research explores the use of real-world audio data from Singapore to enhance urban soun...
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project (FYP) |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/177273 |
_version_ | 1811694692988878848 |
---|---|
author | Wang, Qingqing |
author2 | Gan Woon Seng |
author_facet | Gan Woon Seng Wang, Qingqing |
author_sort | Wang, Qingqing |
collection | NTU |
description | The success of deep learning relies on massive training data. However, obtaining large-scale labeled data is not easy, which is expensive and time-consuming. Addressing the complexities of urban soundscapes, this research explores the use of real-world audio data from Singapore to enhance urban sound classification. The study identifies the shortcomings of existing sound datasets, which often rely on synthetic or controlled environments. But SINGA:PURA Dataset leverages a comprehensive, real-world dataset to more accurately represent the intricate and dynamic nature of urban noise. This study utilizes advanced machine learning techniques, specifically semi-supervised learning methods like pseudo-labeling, to improve the accuracy and reliability of sound classification systems. This approach aims to develop more robust models, making significant contributions to environmental sound analysis in urban areas where managing noise pollution is critically important. |
first_indexed | 2024-10-01T07:11:37Z |
format | Final Year Project (FYP) |
id | ntu-10356/177273 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T07:11:37Z |
publishDate | 2024 |
publisher | Nanyang Technological University |
record_format | dspace |
spelling | ntu-10356/1772732024-05-31T15:44:17Z Curating a strongly labelled urban sound dataset for deep neural network training Wang, Qingqing Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering The success of deep learning relies on massive training data. However, obtaining large-scale labeled data is not easy, which is expensive and time-consuming. Addressing the complexities of urban soundscapes, this research explores the use of real-world audio data from Singapore to enhance urban sound classification. The study identifies the shortcomings of existing sound datasets, which often rely on synthetic or controlled environments. But SINGA:PURA Dataset leverages a comprehensive, real-world dataset to more accurately represent the intricate and dynamic nature of urban noise. This study utilizes advanced machine learning techniques, specifically semi-supervised learning methods like pseudo-labeling, to improve the accuracy and reliability of sound classification systems. This approach aims to develop more robust models, making significant contributions to environmental sound analysis in urban areas where managing noise pollution is critically important. Bachelor's degree 2024-05-27T11:49:58Z 2024-05-27T11:49:58Z 2024 Final Year Project (FYP) Wang, Q. (2024). Curating a strongly labelled urban sound dataset for deep neural network training. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177273 https://hdl.handle.net/10356/177273 en application/pdf Nanyang Technological University |
spellingShingle | Engineering Wang, Qingqing Curating a strongly labelled urban sound dataset for deep neural network training |
title | Curating a strongly labelled urban sound dataset for deep neural network training |
title_full | Curating a strongly labelled urban sound dataset for deep neural network training |
title_fullStr | Curating a strongly labelled urban sound dataset for deep neural network training |
title_full_unstemmed | Curating a strongly labelled urban sound dataset for deep neural network training |
title_short | Curating a strongly labelled urban sound dataset for deep neural network training |
title_sort | curating a strongly labelled urban sound dataset for deep neural network training |
topic | Engineering |
url | https://hdl.handle.net/10356/177273 |
work_keys_str_mv | AT wangqingqing curatingastronglylabelledurbansounddatasetfordeepneuralnetworktraining |