A review of supervised learning methods for classifying animal behavioural states from environmental features

Abstract Accurately predicting behavioural modes of animals in response to environmental features is important for ecology and conservation. Supervised learning (SL) methods are increasingly common in animal movement ecology for classifying behavioural modes. However, few examples exist of applying...

Full description

Bibliographic Details
Main Authors: Silas Bergen, Manuela M. Huso, Adam E. Duerr, Melissa A. Braham, Sara Schmuecker, Tricia A. Miller, Todd E. Katzner
Format: Article
Language:English
Published: Wiley 2023-01-01
Series:Methods in Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1111/2041-210X.14019
_version_ 1797766979870785536
author Silas Bergen
Manuela M. Huso
Adam E. Duerr
Melissa A. Braham
Sara Schmuecker
Tricia A. Miller
Todd E. Katzner
author_facet Silas Bergen
Manuela M. Huso
Adam E. Duerr
Melissa A. Braham
Sara Schmuecker
Tricia A. Miller
Todd E. Katzner
author_sort Silas Bergen
collection DOAJ
description Abstract Accurately predicting behavioural modes of animals in response to environmental features is important for ecology and conservation. Supervised learning (SL) methods are increasingly common in animal movement ecology for classifying behavioural modes. However, few examples exist of applying SL to classify polytomous animal behaviour from environmental features especially in the context of millions of animal observations. We review SL methods (weighted k‐nearest neighbours; neural nets; random forests; and boosted classification trees with XGBoost) for classifying polytomous animal behaviour from environmental predictors. We also describe tuning parameter selection and assessment strategies, approaches for visualizing relationships between predictors and class outputs, and computational considerations. We demonstrate these methods by predicting three categories of risk to bald eagles from colliding with wind turbines using, as predictors, 12 environmental state features associated with 1.7 million GPS telemetry data points from 57 eagles. Of the SL methods we considered, XGBoost yielded the most accurate model with 86.2% classification accuracy and pairwise‐averaged area under the ROC curve of 90.6. Computational time of XGBoost scaled better to large data than any other SL method. We also show how SHAP values integrated in the R package (xgboost) facilitate investigation of variable relationships and importance. For big data applications, XGBoost appears to provide superior classification accuracy and computational efficiency. Our results suggest XGBoost should be considered as an early modelling option in situations where the intent is to classify millions of animal behaviour observations from environmental predictors and to understand relationships between those predictors and movement behaviours. We also offer a tutorial to assist researchers in implementing this method.
first_indexed 2024-03-12T20:33:08Z
format Article
id doaj.art-429ee66087e64e46a244999ea18c995d
institution Directory Open Access Journal
issn 2041-210X
language English
last_indexed 2024-03-12T20:33:08Z
publishDate 2023-01-01
publisher Wiley
record_format Article
series Methods in Ecology and Evolution
spelling doaj.art-429ee66087e64e46a244999ea18c995d2023-08-01T18:55:41ZengWileyMethods in Ecology and Evolution2041-210X2023-01-0114118920210.1111/2041-210X.14019A review of supervised learning methods for classifying animal behavioural states from environmental featuresSilas Bergen0Manuela M. Huso1Adam E. Duerr2Melissa A. Braham3Sara Schmuecker4Tricia A. Miller5Todd E. Katzner6Department of Mathematics and Statistics Winona State University Winona Minnesota USAU.S. Geological Survey Forest and Rangeland Ecosystem Science Center Corvallis Oregon USABloom Research Inc Los Angeles California USAConservation Science Global, Inc West Cape May New Jersey USAU.S. Fish and Wildlife Service Illinois‐Iowa Field Office Moline Illinois USAWest Virginia University Morgantown West Virginia USAU.S. Geological Survey Forest and Rangeland Ecosystem Science Center Boise Idaho USAAbstract Accurately predicting behavioural modes of animals in response to environmental features is important for ecology and conservation. Supervised learning (SL) methods are increasingly common in animal movement ecology for classifying behavioural modes. However, few examples exist of applying SL to classify polytomous animal behaviour from environmental features especially in the context of millions of animal observations. We review SL methods (weighted k‐nearest neighbours; neural nets; random forests; and boosted classification trees with XGBoost) for classifying polytomous animal behaviour from environmental predictors. We also describe tuning parameter selection and assessment strategies, approaches for visualizing relationships between predictors and class outputs, and computational considerations. We demonstrate these methods by predicting three categories of risk to bald eagles from colliding with wind turbines using, as predictors, 12 environmental state features associated with 1.7 million GPS telemetry data points from 57 eagles. Of the SL methods we considered, XGBoost yielded the most accurate model with 86.2% classification accuracy and pairwise‐averaged area under the ROC curve of 90.6. Computational time of XGBoost scaled better to large data than any other SL method. We also show how SHAP values integrated in the R package (xgboost) facilitate investigation of variable relationships and importance. For big data applications, XGBoost appears to provide superior classification accuracy and computational efficiency. Our results suggest XGBoost should be considered as an early modelling option in situations where the intent is to classify millions of animal behaviour observations from environmental predictors and to understand relationships between those predictors and movement behaviours. We also offer a tutorial to assist researchers in implementing this method.https://doi.org/10.1111/2041-210X.14019behavioural classificationboosted classification treeneural networksrandom forestsupervised learningweighted k‐nearest neighbour
spellingShingle Silas Bergen
Manuela M. Huso
Adam E. Duerr
Melissa A. Braham
Sara Schmuecker
Tricia A. Miller
Todd E. Katzner
A review of supervised learning methods for classifying animal behavioural states from environmental features
Methods in Ecology and Evolution
behavioural classification
boosted classification tree
neural networks
random forest
supervised learning
weighted k‐nearest neighbour
title A review of supervised learning methods for classifying animal behavioural states from environmental features
title_full A review of supervised learning methods for classifying animal behavioural states from environmental features
title_fullStr A review of supervised learning methods for classifying animal behavioural states from environmental features
title_full_unstemmed A review of supervised learning methods for classifying animal behavioural states from environmental features
title_short A review of supervised learning methods for classifying animal behavioural states from environmental features
title_sort review of supervised learning methods for classifying animal behavioural states from environmental features
topic behavioural classification
boosted classification tree
neural networks
random forest
supervised learning
weighted k‐nearest neighbour
url https://doi.org/10.1111/2041-210X.14019
work_keys_str_mv AT silasbergen areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT manuelamhuso areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT adameduerr areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT melissaabraham areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT saraschmuecker areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT triciaamiller areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT toddekatzner areviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT silasbergen reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT manuelamhuso reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT adameduerr reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT melissaabraham reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT saraschmuecker reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT triciaamiller reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures
AT toddekatzner reviewofsupervisedlearningmethodsforclassifyinganimalbehaviouralstatesfromenvironmentalfeatures