Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion

The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification sys...

Full description

Bibliographic Details
Main Authors: Vahid Hajihashemi, Abdorreza Alavi Gharahbagh, Pedro Miguel Cruz, Marta Campos Ferreira, José J. M. Machado, João Manuel R. S. Tavares
Format: Article
Language:English
Published: MDPI AG 2022-02-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/4/1535
_version_ 1797476719164129280
author Vahid Hajihashemi
Abdorreza Alavi Gharahbagh
Pedro Miguel Cruz
Marta Campos Ferreira
José J. M. Machado
João Manuel R. S. Tavares
author_facet Vahid Hajihashemi
Abdorreza Alavi Gharahbagh
Pedro Miguel Cruz
Marta Campos Ferreira
José J. M. Machado
João Manuel R. S. Tavares
author_sort Vahid Hajihashemi
collection DOAJ
description The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification systems. This research presents a hybrid method that includes a novel mathematical fusion step which aims to tackle the challenges of ASC accuracy and adaptability of current state-of-the-art models. The proposed method uses a stereo signal, two ensemble classifiers (random subspace), and a novel mathematical fusion step. In the proposed method, a stable, invariant signal representation of the stereo signal is built using Wavelet Scattering Transform (WST). For each mono, i.e., left and right, channel, a different random subspace classifier is trained using WST. A novel mathematical formula for fusion step was developed, its parameters being found using a Genetic algorithm. The results on the DCASE 2017 dataset showed that the proposed method has higher classification accuracy (about 95%), pushing the boundaries of existing methods.
first_indexed 2024-03-09T21:06:38Z
format Article
id doaj.art-e6cd1177e9344346bbb73667fb337b26
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T21:06:38Z
publishDate 2022-02-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-e6cd1177e9344346bbb73667fb337b262023-11-23T22:01:06ZengMDPI AGSensors1424-82202022-02-01224153510.3390/s22041535Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear FusionVahid Hajihashemi0Abdorreza Alavi Gharahbagh1Pedro Miguel Cruz2Marta Campos Ferreira3José J. M. Machado4João Manuel R. S. Tavares5Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465 Porto, PortugalFaculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465 Porto, PortugalBosch Security Systems S.A., EN109-Zona Industrial de Ovar, 3880-080 Ovar, PortugalFaculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465 Porto, PortugalDepartamento de Engenharia Mecânica, Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465 Porto, PortugalDepartamento de Engenharia Mecânica, Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465 Porto, PortugalThe analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification systems. This research presents a hybrid method that includes a novel mathematical fusion step which aims to tackle the challenges of ASC accuracy and adaptability of current state-of-the-art models. The proposed method uses a stereo signal, two ensemble classifiers (random subspace), and a novel mathematical fusion step. In the proposed method, a stable, invariant signal representation of the stereo signal is built using Wavelet Scattering Transform (WST). For each mono, i.e., left and right, channel, a different random subspace classifier is trained using WST. A novel mathematical formula for fusion step was developed, its parameters being found using a Genetic algorithm. The results on the DCASE 2017 dataset showed that the proposed method has higher classification accuracy (about 95%), pushing the boundaries of existing methods.https://www.mdpi.com/1424-8220/22/4/1535urban sounds classificationstereo signalsound base intelligent systemmachine learninggenetic algorithm
spellingShingle Vahid Hajihashemi
Abdorreza Alavi Gharahbagh
Pedro Miguel Cruz
Marta Campos Ferreira
José J. M. Machado
João Manuel R. S. Tavares
Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
Sensors
urban sounds classification
stereo signal
sound base intelligent system
machine learning
genetic algorithm
title Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_full Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_fullStr Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_full_unstemmed Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_short Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_sort binaural acoustic scene classification using wavelet scattering parallel ensemble classifiers and nonlinear fusion
topic urban sounds classification
stereo signal
sound base intelligent system
machine learning
genetic algorithm
url https://www.mdpi.com/1424-8220/22/4/1535
work_keys_str_mv AT vahidhajihashemi binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT abdorrezaalavigharahbagh binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT pedromiguelcruz binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT martacamposferreira binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT josejmmachado binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT joaomanuelrstavares binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion