The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models

Abstract The use of generalized linear models in Bayesian phylogeography has enabled researchers to simultaneously reconstruct the spatiotemporal history of a virus and quantify the contribution of predictor variables to that process. However, little is known about the sensitivity of this method to...

Full description

Bibliographic Details
Main Authors: Daniel Magee, Jesse E. Taylor, Matthew Scotch
Format: Article
Language:English
Published: Nature Portfolio 2018-04-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-018-24264-8
_version_ 1819202652819423232
author Daniel Magee
Jesse E. Taylor
Matthew Scotch
author_facet Daniel Magee
Jesse E. Taylor
Matthew Scotch
author_sort Daniel Magee
collection DOAJ
description Abstract The use of generalized linear models in Bayesian phylogeography has enabled researchers to simultaneously reconstruct the spatiotemporal history of a virus and quantify the contribution of predictor variables to that process. However, little is known about the sensitivity of this method to the choice of the discrete state partition. Here we investigate this question by analyzing a data set containing 299 sequences of the West Nile virus envelope gene sampled in the United States and fifteen predictors aggregated at four spatial levels. We demonstrate that although the topology of the viral phylogenies was consistent across analyses, support for the predictors depended on the level of aggregation. In particular, we found that the variance of the predictor support metrics was minimized at the most precise level for several predictors and maximized at more sparse levels of aggregation. These results suggest that caution should be taken when partitioning a region into discrete locations to ensure that interpretable, reproducible posterior estimates are obtained. These results also demonstrate why researchers should use the most precise discrete states possible to minimize the posterior variance in such estimates and reveal what truly drives the diffusion of viruses.
first_indexed 2024-12-23T04:07:26Z
format Article
id doaj.art-79ea25c0e10c4e058f1208e66440e22e
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-12-23T04:07:26Z
publishDate 2018-04-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-79ea25c0e10c4e058f1208e66440e22e2022-12-21T18:00:36ZengNature PortfolioScientific Reports2045-23222018-04-018111210.1038/s41598-018-24264-8The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear ModelsDaniel Magee0Jesse E. Taylor1Matthew Scotch2Department of Biomedical Informatics, Arizona State UniversitySchool of Mathematical and Statistical Sciences, Arizona State UniversityDepartment of Biomedical Informatics, Arizona State UniversityAbstract The use of generalized linear models in Bayesian phylogeography has enabled researchers to simultaneously reconstruct the spatiotemporal history of a virus and quantify the contribution of predictor variables to that process. However, little is known about the sensitivity of this method to the choice of the discrete state partition. Here we investigate this question by analyzing a data set containing 299 sequences of the West Nile virus envelope gene sampled in the United States and fifteen predictors aggregated at four spatial levels. We demonstrate that although the topology of the viral phylogenies was consistent across analyses, support for the predictors depended on the level of aggregation. In particular, we found that the variance of the predictor support metrics was minimized at the most precise level for several predictors and maximized at more sparse levels of aggregation. These results suggest that caution should be taken when partitioning a region into discrete locations to ensure that interpretable, reproducible posterior estimates are obtained. These results also demonstrate why researchers should use the most precise discrete states possible to minimize the posterior variance in such estimates and reveal what truly drives the diffusion of viruses.https://doi.org/10.1038/s41598-018-24264-8
spellingShingle Daniel Magee
Jesse E. Taylor
Matthew Scotch
The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models
Scientific Reports
title The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models
title_full The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models
title_fullStr The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models
title_full_unstemmed The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models
title_short The Effects of Sampling Location and Predictor Point Estimate Certainty on Posterior Support in Bayesian Phylogeographic Generalized Linear Models
title_sort effects of sampling location and predictor point estimate certainty on posterior support in bayesian phylogeographic generalized linear models
url https://doi.org/10.1038/s41598-018-24264-8
work_keys_str_mv AT danielmagee theeffectsofsamplinglocationandpredictorpointestimatecertaintyonposteriorsupportinbayesianphylogeographicgeneralizedlinearmodels
AT jesseetaylor theeffectsofsamplinglocationandpredictorpointestimatecertaintyonposteriorsupportinbayesianphylogeographicgeneralizedlinearmodels
AT matthewscotch theeffectsofsamplinglocationandpredictorpointestimatecertaintyonposteriorsupportinbayesianphylogeographicgeneralizedlinearmodels
AT danielmagee effectsofsamplinglocationandpredictorpointestimatecertaintyonposteriorsupportinbayesianphylogeographicgeneralizedlinearmodels
AT jesseetaylor effectsofsamplinglocationandpredictorpointestimatecertaintyonposteriorsupportinbayesianphylogeographicgeneralizedlinearmodels
AT matthewscotch effectsofsamplinglocationandpredictorpointestimatecertaintyonposteriorsupportinbayesianphylogeographicgeneralizedlinearmodels