Bird and whale species identification using sound images

Image identification of animals is mostly centred on identifying them based on their appearance, but there are other ways images can be used to identify animals, including by representing the sounds they make with images. In this study, the authors present a novel and effective approach for automate...

Full description

Bibliographic Details
Main Authors: Loris Nanni, Rafael L. Aguiar, Yandre M.G. Costa, Sheryl Brahnam, Carlos N. Silla Jr., Ricky L. Brattin, Zhao Zhao
Format: Article
Language:English
Published: Wiley 2018-03-01
Series:IET Computer Vision
Subjects:
Online Access:https://doi.org/10.1049/iet-cvi.2017.0075
_version_ 1827817621969436672
author Loris Nanni
Rafael L. Aguiar
Yandre M.G. Costa
Sheryl Brahnam
Carlos N. Silla Jr.
Ricky L. Brattin
Zhao Zhao
author_facet Loris Nanni
Rafael L. Aguiar
Yandre M.G. Costa
Sheryl Brahnam
Carlos N. Silla Jr.
Ricky L. Brattin
Zhao Zhao
author_sort Loris Nanni
collection DOAJ
description Image identification of animals is mostly centred on identifying them based on their appearance, but there are other ways images can be used to identify animals, including by representing the sounds they make with images. In this study, the authors present a novel and effective approach for automated identification of birds and whales using some of the best texture descriptors in the computer vision literature. The visual features of sounds are built starting from the audio file and are taken from images constructed from different spectrograms and from harmonic and percussion images. These images are divided into sub‐windows from which sets of texture descriptors are extracted. The experiments reported in this study using a dataset of Bird vocalisations targeted for species recognition and a dataset of right whale calls targeted for whale detection (as well as three well‐known benchmarks for music genre classification) demonstrate that the fusion of different texture features enhances performance. The experiments also demonstrate that the fusion of different texture features with audio features is not only comparable with existing audio signal approaches but also statistically improves some of the stand‐alone audio features. The code for the experiments will be publicly available at https://www.dropbox.com/s/bguw035yrqz0pwp/ElencoCode.docx?dl=0.
first_indexed 2024-03-12T00:36:51Z
format Article
id doaj.art-1166c676d93548d4b4ea33a06c9384b8
institution Directory Open Access Journal
issn 1751-9632
1751-9640
language English
last_indexed 2024-03-12T00:36:51Z
publishDate 2018-03-01
publisher Wiley
record_format Article
series IET Computer Vision
spelling doaj.art-1166c676d93548d4b4ea33a06c9384b82023-09-15T09:32:39ZengWileyIET Computer Vision1751-96321751-96402018-03-0112217818410.1049/iet-cvi.2017.0075Bird and whale species identification using sound imagesLoris Nanni0Rafael L. Aguiar1Yandre M.G. Costa2Sheryl Brahnam3Carlos N. Silla Jr.4Ricky L. Brattin5Zhao Zhao6DEIUniversity of PaduaItalyPCC/DINState University of MaringáMaringáBrazilPCC/DINState University of MaringáMaringáBrazilCISMissouri State UniversitySpringfieldUSAPPGIAPontifical Catholic University of ParanáCuritibaBrazilCISMissouri State UniversitySpringfieldUSASchool of Electronic and Optical EngineeringNanjing University of Science and TechnologyNanjing210094People's Republic of ChinaImage identification of animals is mostly centred on identifying them based on their appearance, but there are other ways images can be used to identify animals, including by representing the sounds they make with images. In this study, the authors present a novel and effective approach for automated identification of birds and whales using some of the best texture descriptors in the computer vision literature. The visual features of sounds are built starting from the audio file and are taken from images constructed from different spectrograms and from harmonic and percussion images. These images are divided into sub‐windows from which sets of texture descriptors are extracted. The experiments reported in this study using a dataset of Bird vocalisations targeted for species recognition and a dataset of right whale calls targeted for whale detection (as well as three well‐known benchmarks for music genre classification) demonstrate that the fusion of different texture features enhances performance. The experiments also demonstrate that the fusion of different texture features with audio features is not only comparable with existing audio signal approaches but also statistically improves some of the stand‐alone audio features. The code for the experiments will be publicly available at https://www.dropbox.com/s/bguw035yrqz0pwp/ElencoCode.docx?dl=0.https://doi.org/10.1049/iet-cvi.2017.0075sound imagesvisual featurespercussion imagesharmonic imagesbird vocalisationstexture features
spellingShingle Loris Nanni
Rafael L. Aguiar
Yandre M.G. Costa
Sheryl Brahnam
Carlos N. Silla Jr.
Ricky L. Brattin
Zhao Zhao
Bird and whale species identification using sound images
IET Computer Vision
sound images
visual features
percussion images
harmonic images
bird vocalisations
texture features
title Bird and whale species identification using sound images
title_full Bird and whale species identification using sound images
title_fullStr Bird and whale species identification using sound images
title_full_unstemmed Bird and whale species identification using sound images
title_short Bird and whale species identification using sound images
title_sort bird and whale species identification using sound images
topic sound images
visual features
percussion images
harmonic images
bird vocalisations
texture features
url https://doi.org/10.1049/iet-cvi.2017.0075
work_keys_str_mv AT lorisnanni birdandwhalespeciesidentificationusingsoundimages
AT rafaellaguiar birdandwhalespeciesidentificationusingsoundimages
AT yandremgcosta birdandwhalespeciesidentificationusingsoundimages
AT sherylbrahnam birdandwhalespeciesidentificationusingsoundimages
AT carlosnsillajr birdandwhalespeciesidentificationusingsoundimages
AT rickylbrattin birdandwhalespeciesidentificationusingsoundimages
AT zhaozhao birdandwhalespeciesidentificationusingsoundimages