Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset

Abstract To analyze species count data when detection is imperfect, ecologists need models to estimate relative abundance in the presence of unknown sources of heterogeneity. Two candidate models are generalized linear mixed models (GLMMs) and hierarchical N-mixture models. GLMMs are computationally...

Full description

Bibliographic Details
Main Authors: Benjamin R. Goldstein, Perry de Valpine
Format: Article
Language:English
Published: Nature Portfolio 2022-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-022-16368-z
_version_ 1811287147076911104
author Benjamin R. Goldstein
Perry de Valpine
author_facet Benjamin R. Goldstein
Perry de Valpine
author_sort Benjamin R. Goldstein
collection DOAJ
description Abstract To analyze species count data when detection is imperfect, ecologists need models to estimate relative abundance in the presence of unknown sources of heterogeneity. Two candidate models are generalized linear mixed models (GLMMs) and hierarchical N-mixture models. GLMMs are computationally robust but do not explicitly separate detection from abundance patterns. N-mixture models separately estimate detection and abundance via a latent state but are sensitive to violations in assumptions and subject to practical estimation issues. When one can assume that detection is not systematically confounded with ecological patterns of interest, these two models can be viewed as sharing a heuristic framework for relative abundance estimation. Model selection can then determine which predicts observed counts best, for example by AIC. We compared four N-mixture model variants and two GLMM variants for predicting bird counts in local subsets of a citizen science dataset, eBird, based on model selection and goodness-of-fit measures. We found that both GLMMs and N-mixture models—especially N-mixtures with beta-binomial detection submodels—were supported in a moderate number of datasets, suggesting that both tools are useful and that relative fit is context-dependent. We provide faster software implementations of N-mixture likelihood calculations and a reparameterization to interpret unstable estimates for N-mixture models.
first_indexed 2024-04-13T03:13:34Z
format Article
id doaj.art-e4fcfb9f6af344b19158af8e7031e246
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-13T03:13:34Z
publishDate 2022-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-e4fcfb9f6af344b19158af8e7031e2462022-12-22T03:04:59ZengNature PortfolioScientific Reports2045-23222022-07-0112111210.1038/s41598-022-16368-zComparing N-mixture models and GLMMs for relative abundance estimation in a citizen science datasetBenjamin R. Goldstein0Perry de Valpine1Department of Environmental Science, Policy, and Management, University of California, BerkeleyDepartment of Environmental Science, Policy, and Management, University of California, BerkeleyAbstract To analyze species count data when detection is imperfect, ecologists need models to estimate relative abundance in the presence of unknown sources of heterogeneity. Two candidate models are generalized linear mixed models (GLMMs) and hierarchical N-mixture models. GLMMs are computationally robust but do not explicitly separate detection from abundance patterns. N-mixture models separately estimate detection and abundance via a latent state but are sensitive to violations in assumptions and subject to practical estimation issues. When one can assume that detection is not systematically confounded with ecological patterns of interest, these two models can be viewed as sharing a heuristic framework for relative abundance estimation. Model selection can then determine which predicts observed counts best, for example by AIC. We compared four N-mixture model variants and two GLMM variants for predicting bird counts in local subsets of a citizen science dataset, eBird, based on model selection and goodness-of-fit measures. We found that both GLMMs and N-mixture models—especially N-mixtures with beta-binomial detection submodels—were supported in a moderate number of datasets, suggesting that both tools are useful and that relative fit is context-dependent. We provide faster software implementations of N-mixture likelihood calculations and a reparameterization to interpret unstable estimates for N-mixture models.https://doi.org/10.1038/s41598-022-16368-z
spellingShingle Benjamin R. Goldstein
Perry de Valpine
Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset
Scientific Reports
title Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset
title_full Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset
title_fullStr Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset
title_full_unstemmed Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset
title_short Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset
title_sort comparing n mixture models and glmms for relative abundance estimation in a citizen science dataset
url https://doi.org/10.1038/s41598-022-16368-z
work_keys_str_mv AT benjaminrgoldstein comparingnmixturemodelsandglmmsforrelativeabundanceestimationinacitizensciencedataset
AT perrydevalpine comparingnmixturemodelsandglmmsforrelativeabundanceestimationinacitizensciencedataset