Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study

Species distribution models (SDM) are widely used to describe and explain how species relate to their environment and predict their spatial distributions. As such, they are the cornerstone of most of spatial planning efforts worldwide. SDM can be implemented with a wide array of data types (presence...

Full description

Bibliographic Details
Main Authors: Lambert, Charlotte, Virgili, Auriane
Format: Article
Language:English
Published: Peer Community In 2023-04-01
Series:Peer Community Journal
Online Access:https://peercommunityjournal.org/articles/10.24072/pcjournal.263/
_version_ 1797518052698357760
author Lambert, Charlotte
Virgili, Auriane
author_facet Lambert, Charlotte
Virgili, Auriane
author_sort Lambert, Charlotte
collection DOAJ
description Species distribution models (SDM) are widely used to describe and explain how species relate to their environment and predict their spatial distributions. As such, they are the cornerstone of most of spatial planning efforts worldwide. SDM can be implemented with a wide array of data types (presence-only, presence-absence, count...), which can either be point- or areal-based, and use a wide array of environmental conditions as predictor variables. The choice of the sampling type as well as the resolution of environmental conditions to be used are recognized as of crucial importance, yet we lack any quantification of the effects these decisions may have on SDM reliability. In the present work, we fill this gap with an unprecedented simulation procedure. We simulated 100 possible distributions of two different virtual species in two different regions. Species distribution were modelled using either segment- or areal-based sampling and five different spatial resolutions of environmental conditions. The SDM performances were inspected by statistical metrics, model composition, shapes of relationships and prediction quality. We provided clear evidence of stochasticity in the modelling process (particularly in the shapes of relationships): two dataset from the same survey, species and region could yield different results. Sampling type had stronger effects than spatial resolution on the final model relevance. The effect of coarsening the resolution was directly related to the resistance of the spatial features to changes of scale: SDM failed to adequately identify spatial distributions when the spatial features targeted by the species were diluted by resolution coarsening. These results have important implications for the SDM community, backing up some commonly accepted choices, but also by highlighting some up-to-now unexpected features of SDM (stochasticity). As a whole, this work calls for carefully weighted decisions in implementing models, and for caution in interpreting results.
first_indexed 2024-03-10T07:23:47Z
format Article
id doaj.art-0e7c58b1ae7544c8aff32d1c4b7a2edb
institution Directory Open Access Journal
issn 2804-3871
language English
last_indexed 2024-03-10T07:23:47Z
publishDate 2023-04-01
publisher Peer Community In
record_format Article
series Peer Community Journal
spelling doaj.art-0e7c58b1ae7544c8aff32d1c4b7a2edb2023-11-22T14:22:10ZengPeer Community InPeer Community Journal2804-38712023-04-01310.24072/pcjournal.26310.24072/pcjournal.263Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study Lambert, Charlotte0https://orcid.org/0000-0002-1128-5262Virgili, Auriane1https://orcid.org/0000-0002-7581-7985Littoral ENvironnement et Sociétés UMR 7266 CNRS-LRUniv, 2 Rue Olympe de Gouges, 17000 La Rochelle, FranceObservatoire Pelagis UAR 3462 CNRS-LRUniv, 5 allée de l’Océan, 17000 La Rochelle, FranceSpecies distribution models (SDM) are widely used to describe and explain how species relate to their environment and predict their spatial distributions. As such, they are the cornerstone of most of spatial planning efforts worldwide. SDM can be implemented with a wide array of data types (presence-only, presence-absence, count...), which can either be point- or areal-based, and use a wide array of environmental conditions as predictor variables. The choice of the sampling type as well as the resolution of environmental conditions to be used are recognized as of crucial importance, yet we lack any quantification of the effects these decisions may have on SDM reliability. In the present work, we fill this gap with an unprecedented simulation procedure. We simulated 100 possible distributions of two different virtual species in two different regions. Species distribution were modelled using either segment- or areal-based sampling and five different spatial resolutions of environmental conditions. The SDM performances were inspected by statistical metrics, model composition, shapes of relationships and prediction quality. We provided clear evidence of stochasticity in the modelling process (particularly in the shapes of relationships): two dataset from the same survey, species and region could yield different results. Sampling type had stronger effects than spatial resolution on the final model relevance. The effect of coarsening the resolution was directly related to the resistance of the spatial features to changes of scale: SDM failed to adequately identify spatial distributions when the spatial features targeted by the species were diluted by resolution coarsening. These results have important implications for the SDM community, backing up some commonly accepted choices, but also by highlighting some up-to-now unexpected features of SDM (stochasticity). As a whole, this work calls for carefully weighted decisions in implementing models, and for caution in interpreting results. https://peercommunityjournal.org/articles/10.24072/pcjournal.263/
spellingShingle Lambert, Charlotte
Virgili, Auriane
Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study
Peer Community Journal
title Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study
title_full Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study
title_fullStr Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study
title_full_unstemmed Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study
title_short Data stochasticity and model parametrisation impact the performance of species distribution models: insights from a simulation study
title_sort data stochasticity and model parametrisation impact the performance of species distribution models insights from a simulation study
url https://peercommunityjournal.org/articles/10.24072/pcjournal.263/
work_keys_str_mv AT lambertcharlotte datastochasticityandmodelparametrisationimpacttheperformanceofspeciesdistributionmodelsinsightsfromasimulationstudy
AT virgiliauriane datastochasticityandmodelparametrisationimpacttheperformanceofspeciesdistributionmodelsinsightsfromasimulationstudy