Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny

Abstract Birdsong is a longstanding model system for studying evolution and biodiversity. Here, we collected and analyzed high quality song recordings from seven species in the family Estrildidae. We measured the acoustic features of syllables and then used dimensionality reduction and machine learn...

Full description

Bibliographic Details
Main Authors: Moises Rivera, Jacob A. Edwards, Mark E. Hauber, Sarah M. N. Woolley
Format: Article
Language:English
Published: Nature Portfolio 2023-05-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-33825-5
_version_ 1797811563621515264
author Moises Rivera
Jacob A. Edwards
Mark E. Hauber
Sarah M. N. Woolley
author_facet Moises Rivera
Jacob A. Edwards
Mark E. Hauber
Sarah M. N. Woolley
author_sort Moises Rivera
collection DOAJ
description Abstract Birdsong is a longstanding model system for studying evolution and biodiversity. Here, we collected and analyzed high quality song recordings from seven species in the family Estrildidae. We measured the acoustic features of syllables and then used dimensionality reduction and machine learning classifiers to identify features that accurately assigned syllables to species. Species differences were captured by the first 3 principal components, corresponding to basic frequency, power distribution, and spectrotemporal features. We then identified the measured features underlying classification accuracy. We found that fundamental frequency, mean frequency, spectral flatness, and syllable duration were the most informative features for species identification. Next, we tested whether specific acoustic features of species’ songs predicted phylogenetic distance. We found significant phylogenetic signal in syllable frequency features, but not in power distribution or spectrotemporal features. Results suggest that frequency features are more constrained by species’ genetics than are other features, and are the best signal features for identifying species from song recordings. The absence of phylogenetic signal in power distribution and spectrotemporal features suggests that these song features are labile, reflecting learning processes and individual recognition.
first_indexed 2024-03-13T07:24:35Z
format Article
id doaj.art-a9b8cc1e505443e68da6c1114ff4703d
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-13T07:24:35Z
publishDate 2023-05-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-a9b8cc1e505443e68da6c1114ff4703d2023-06-04T11:27:18ZengNature PortfolioScientific Reports2045-23222023-05-0113111810.1038/s41598-023-33825-5Machine learning and statistical classification of birdsong link vocal acoustic features with phylogenyMoises Rivera0Jacob A. Edwards1Mark E. Hauber2Sarah M. N. Woolley3Department of Psychology, Hunter College and the Graduate Center, City University of New YorkMortimer B. Zuckerman Mind, Brain, and Behavior Institute, Columbia UniversityDepartment of Evolution, Ecology, and Behavior, School of Biological Sciences, University of Illinois at Urbana-ChampaignMortimer B. Zuckerman Mind, Brain, and Behavior Institute, Columbia UniversityAbstract Birdsong is a longstanding model system for studying evolution and biodiversity. Here, we collected and analyzed high quality song recordings from seven species in the family Estrildidae. We measured the acoustic features of syllables and then used dimensionality reduction and machine learning classifiers to identify features that accurately assigned syllables to species. Species differences were captured by the first 3 principal components, corresponding to basic frequency, power distribution, and spectrotemporal features. We then identified the measured features underlying classification accuracy. We found that fundamental frequency, mean frequency, spectral flatness, and syllable duration were the most informative features for species identification. Next, we tested whether specific acoustic features of species’ songs predicted phylogenetic distance. We found significant phylogenetic signal in syllable frequency features, but not in power distribution or spectrotemporal features. Results suggest that frequency features are more constrained by species’ genetics than are other features, and are the best signal features for identifying species from song recordings. The absence of phylogenetic signal in power distribution and spectrotemporal features suggests that these song features are labile, reflecting learning processes and individual recognition.https://doi.org/10.1038/s41598-023-33825-5
spellingShingle Moises Rivera
Jacob A. Edwards
Mark E. Hauber
Sarah M. N. Woolley
Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
Scientific Reports
title Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
title_full Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
title_fullStr Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
title_full_unstemmed Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
title_short Machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
title_sort machine learning and statistical classification of birdsong link vocal acoustic features with phylogeny
url https://doi.org/10.1038/s41598-023-33825-5
work_keys_str_mv AT moisesrivera machinelearningandstatisticalclassificationofbirdsonglinkvocalacousticfeatureswithphylogeny
AT jacobaedwards machinelearningandstatisticalclassificationofbirdsonglinkvocalacousticfeatureswithphylogeny
AT markehauber machinelearningandstatisticalclassificationofbirdsonglinkvocalacousticfeatureswithphylogeny
AT sarahmnwoolley machinelearningandstatisticalclassificationofbirdsonglinkvocalacousticfeatureswithphylogeny