A species richness estimator for sample‐based incidence data sampled without replacement

Abstract The accurate estimation of species richness in a target region is still a statistical challenge, especially in a highly heterogeneous community. Most richness estimators have been developed based on the assumption that data are randomly sampled with replacement or that data are sampled from...

Full description

Bibliographic Details
Main Author: Chun‐Huo Chiu
Format: Article
Language:English
Published: Wiley 2023-09-01
Series:Methods in Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1111/2041-210X.14146
_version_ 1797692092156215296
author Chun‐Huo Chiu
author_facet Chun‐Huo Chiu
author_sort Chun‐Huo Chiu
collection DOAJ
description Abstract The accurate estimation of species richness in a target region is still a statistical challenge, especially in a highly heterogeneous community. Most richness estimators have been developed based on the assumption that data are randomly sampled with replacement or that data are sampled from an infinite population. However, in reality, most sampling schemes in the field are implemented as sampling without replacement (SWOR). As such, estimators derived based on sampling with replacement may cause overestimation as the sampling fraction increases and not converge to the true richness as the sampling fraction approaches one. Sample‐based incidence data, in which the sampling unit is a plot, and only the presence or absence of a species in each chosen plot is recorded, is one of the most commonly used data types for assessing species diversity in ecological studies. In this manuscript, according to sample‐based incidence data collected through SWOR, a new richness estimator is proposed using a truncated beta‐binomial mixture model. The new estimator was obtained through the moment approach, which avoids using iterative numerical algorithms for parameter estimation and presents a closed‐form estimator as an alternative to the maximum likelihood method. Although the newly proposed method is a parametric‐based richness estimator, similar to nonparametric estimators, only the rare species in the sample (i.e. the frequencies of uniques and duplicates) are required to estimate undetected richness. Based on the hypothetical models, the statistical performances of the proposed estimator are evaluated under varying degrees of heterogeneity and different mean species detection rates. Compared to other widely used nonparametric and parametric estimators, the simulation results indicate that the proposed estimator has a smaller bias and lower root mean square error when the sampling fraction is greater than 10%, particularly in highly heterogeneous communities. In addition, one ForestGEO permanent forest plot dataset is used to evaluate and compare the proposed approach with other estimators discussed in the study. The results demonstrate that the proposed estimator, in comparison to other widely used estimators, produces less biased estimate of true richness, along with more accurate 95% confidence interval.
first_indexed 2024-03-12T02:22:42Z
format Article
id doaj.art-fd488256d86f4bf6bc8d561ed720b413
institution Directory Open Access Journal
issn 2041-210X
language English
last_indexed 2024-03-12T02:22:42Z
publishDate 2023-09-01
publisher Wiley
record_format Article
series Methods in Ecology and Evolution
spelling doaj.art-fd488256d86f4bf6bc8d561ed720b4132023-09-06T04:43:40ZengWileyMethods in Ecology and Evolution2041-210X2023-09-011492482249310.1111/2041-210X.14146A species richness estimator for sample‐based incidence data sampled without replacementChun‐Huo Chiu0Department of Agronomy National Taiwan University Taipei TaiwanAbstract The accurate estimation of species richness in a target region is still a statistical challenge, especially in a highly heterogeneous community. Most richness estimators have been developed based on the assumption that data are randomly sampled with replacement or that data are sampled from an infinite population. However, in reality, most sampling schemes in the field are implemented as sampling without replacement (SWOR). As such, estimators derived based on sampling with replacement may cause overestimation as the sampling fraction increases and not converge to the true richness as the sampling fraction approaches one. Sample‐based incidence data, in which the sampling unit is a plot, and only the presence or absence of a species in each chosen plot is recorded, is one of the most commonly used data types for assessing species diversity in ecological studies. In this manuscript, according to sample‐based incidence data collected through SWOR, a new richness estimator is proposed using a truncated beta‐binomial mixture model. The new estimator was obtained through the moment approach, which avoids using iterative numerical algorithms for parameter estimation and presents a closed‐form estimator as an alternative to the maximum likelihood method. Although the newly proposed method is a parametric‐based richness estimator, similar to nonparametric estimators, only the rare species in the sample (i.e. the frequencies of uniques and duplicates) are required to estimate undetected richness. Based on the hypothetical models, the statistical performances of the proposed estimator are evaluated under varying degrees of heterogeneity and different mean species detection rates. Compared to other widely used nonparametric and parametric estimators, the simulation results indicate that the proposed estimator has a smaller bias and lower root mean square error when the sampling fraction is greater than 10%, particularly in highly heterogeneous communities. In addition, one ForestGEO permanent forest plot dataset is used to evaluate and compare the proposed approach with other estimators discussed in the study. The results demonstrate that the proposed estimator, in comparison to other widely used estimators, produces less biased estimate of true richness, along with more accurate 95% confidence interval.https://doi.org/10.1111/2041-210X.14146ForestGEO permanent forest plotrichness estimatorsample‐based incidence datasampling without replacementtruncated beta‐binomial model
spellingShingle Chun‐Huo Chiu
A species richness estimator for sample‐based incidence data sampled without replacement
Methods in Ecology and Evolution
ForestGEO permanent forest plot
richness estimator
sample‐based incidence data
sampling without replacement
truncated beta‐binomial model
title A species richness estimator for sample‐based incidence data sampled without replacement
title_full A species richness estimator for sample‐based incidence data sampled without replacement
title_fullStr A species richness estimator for sample‐based incidence data sampled without replacement
title_full_unstemmed A species richness estimator for sample‐based incidence data sampled without replacement
title_short A species richness estimator for sample‐based incidence data sampled without replacement
title_sort species richness estimator for sample based incidence data sampled without replacement
topic ForestGEO permanent forest plot
richness estimator
sample‐based incidence data
sampling without replacement
truncated beta‐binomial model
url https://doi.org/10.1111/2041-210X.14146
work_keys_str_mv AT chunhuochiu aspeciesrichnessestimatorforsamplebasedincidencedatasampledwithoutreplacement
AT chunhuochiu speciesrichnessestimatorforsamplebasedincidencedatasampledwithoutreplacement