Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.

Feature extraction for Acoustic Bird Species Classification (ABSC) tasks has traditionally been based on parametric representations that were specifically developed for speech signals, such as Mel Frequency Cepstral Coefficients (MFCC). However, the discrimination capabilities of these features for...

Full description

Bibliographic Details
Main Authors: Jimmy Ludeña-Choez, Raisa Quispe-Soncco, Ascensión Gallardo-Antolín
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5476267?pdf=render
_version_ 1818161323247140864
author Jimmy Ludeña-Choez
Raisa Quispe-Soncco
Ascensión Gallardo-Antolín
author_facet Jimmy Ludeña-Choez
Raisa Quispe-Soncco
Ascensión Gallardo-Antolín
author_sort Jimmy Ludeña-Choez
collection DOAJ
description Feature extraction for Acoustic Bird Species Classification (ABSC) tasks has traditionally been based on parametric representations that were specifically developed for speech signals, such as Mel Frequency Cepstral Coefficients (MFCC). However, the discrimination capabilities of these features for ABSC could be enhanced by accounting for the vocal production mechanisms of birds, and, in particular, the spectro-temporal structure of bird sounds. In this paper, a new front-end for ABSC is proposed that incorporates this specific information through the non-negative decomposition of bird sound spectrograms. It consists of the following two different stages: short-time feature extraction and temporal feature integration. In the first stage, which aims at providing a better spectral representation of bird sounds on a frame-by-frame basis, two methods are evaluated. In the first method, cepstral-like features (NMF_CC) are extracted by using a filter bank that is automatically learned by means of the application of Non-Negative Matrix Factorization (NMF) on bird audio spectrograms. In the second method, the features are directly derived from the activation coefficients of the spectrogram decomposition as performed through NMF (H_CC). The second stage summarizes the most relevant information contained in the short-time features by computing several statistical measures over long segments. The experiments show that the use of NMF_CC and H_CC in conjunction with temporal integration significantly improves the performance of a Support Vector Machine (SVM)-based ABSC system with respect to conventional MFCC.
first_indexed 2024-12-11T16:15:57Z
format Article
id doaj.art-6445c69f8873483697d6217b981dff9d
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-11T16:15:57Z
publishDate 2017-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-6445c69f8873483697d6217b981dff9d2022-12-22T00:58:57ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01126e017940310.1371/journal.pone.0179403Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.Jimmy Ludeña-ChoezRaisa Quispe-SonccoAscensión Gallardo-AntolínFeature extraction for Acoustic Bird Species Classification (ABSC) tasks has traditionally been based on parametric representations that were specifically developed for speech signals, such as Mel Frequency Cepstral Coefficients (MFCC). However, the discrimination capabilities of these features for ABSC could be enhanced by accounting for the vocal production mechanisms of birds, and, in particular, the spectro-temporal structure of bird sounds. In this paper, a new front-end for ABSC is proposed that incorporates this specific information through the non-negative decomposition of bird sound spectrograms. It consists of the following two different stages: short-time feature extraction and temporal feature integration. In the first stage, which aims at providing a better spectral representation of bird sounds on a frame-by-frame basis, two methods are evaluated. In the first method, cepstral-like features (NMF_CC) are extracted by using a filter bank that is automatically learned by means of the application of Non-Negative Matrix Factorization (NMF) on bird audio spectrograms. In the second method, the features are directly derived from the activation coefficients of the spectrogram decomposition as performed through NMF (H_CC). The second stage summarizes the most relevant information contained in the short-time features by computing several statistical measures over long segments. The experiments show that the use of NMF_CC and H_CC in conjunction with temporal integration significantly improves the performance of a Support Vector Machine (SVM)-based ABSC system with respect to conventional MFCC.http://europepmc.org/articles/PMC5476267?pdf=render
spellingShingle Jimmy Ludeña-Choez
Raisa Quispe-Soncco
Ascensión Gallardo-Antolín
Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.
PLoS ONE
title Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.
title_full Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.
title_fullStr Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.
title_full_unstemmed Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.
title_short Bird sound spectrogram decomposition through Non-Negative Matrix Factorization for the acoustic classification of bird species.
title_sort bird sound spectrogram decomposition through non negative matrix factorization for the acoustic classification of bird species
url http://europepmc.org/articles/PMC5476267?pdf=render
work_keys_str_mv AT jimmyludenachoez birdsoundspectrogramdecompositionthroughnonnegativematrixfactorizationfortheacousticclassificationofbirdspecies
AT raisaquispesoncco birdsoundspectrogramdecompositionthroughnonnegativematrixfactorizationfortheacousticclassificationofbirdspecies
AT ascensiongallardoantolin birdsoundspectrogramdecompositionthroughnonnegativematrixfactorizationfortheacousticclassificationofbirdspecies