Estimating effective population size changes from preferentially sampled genetic sequences.

Coalescent theory combined with statistical modeling allows us to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. When sequences are sampled serially through time and the distribution of the sampling times depends on the...

Full description

Bibliographic Details
Main Authors: Michael D Karcher, Luiz Max Carvalho, Marc A Suchard, Gytis Dudas, Vladimir N Minin
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-10-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1007774
_version_ 1818348866602270720
author Michael D Karcher
Luiz Max Carvalho
Marc A Suchard
Gytis Dudas
Vladimir N Minin
author_facet Michael D Karcher
Luiz Max Carvalho
Marc A Suchard
Gytis Dudas
Vladimir N Minin
author_sort Michael D Karcher
collection DOAJ
description Coalescent theory combined with statistical modeling allows us to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. When sequences are sampled serially through time and the distribution of the sampling times depends on the effective population size, explicit statistical modeling of sampling times improves population size estimation. Previous work assumed that the genealogy relating sampled sequences is known and modeled sampling times as an inhomogeneous Poisson process with log-intensity equal to a linear function of the log-transformed effective population size. We improve this approach in two ways. First, we extend the method to allow for joint Bayesian estimation of the genealogy, effective population size trajectory, and other model parameters. Next, we improve the sampling time model by incorporating additional sources of information in the form of time-varying covariates. We validate our new modeling framework using a simulation study and apply our new methodology to analyses of population dynamics of seasonal influenza and to the recent Ebola virus outbreak in West Africa.
first_indexed 2024-12-13T17:56:52Z
format Article
id doaj.art-a12e8166102741dfb75db7e805a95615
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-13T17:56:52Z
publishDate 2020-10-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-a12e8166102741dfb75db7e805a956152022-12-21T23:36:22ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-10-011610e100777410.1371/journal.pcbi.1007774Estimating effective population size changes from preferentially sampled genetic sequences.Michael D KarcherLuiz Max CarvalhoMarc A SuchardGytis DudasVladimir N MininCoalescent theory combined with statistical modeling allows us to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. When sequences are sampled serially through time and the distribution of the sampling times depends on the effective population size, explicit statistical modeling of sampling times improves population size estimation. Previous work assumed that the genealogy relating sampled sequences is known and modeled sampling times as an inhomogeneous Poisson process with log-intensity equal to a linear function of the log-transformed effective population size. We improve this approach in two ways. First, we extend the method to allow for joint Bayesian estimation of the genealogy, effective population size trajectory, and other model parameters. Next, we improve the sampling time model by incorporating additional sources of information in the form of time-varying covariates. We validate our new modeling framework using a simulation study and apply our new methodology to analyses of population dynamics of seasonal influenza and to the recent Ebola virus outbreak in West Africa.https://doi.org/10.1371/journal.pcbi.1007774
spellingShingle Michael D Karcher
Luiz Max Carvalho
Marc A Suchard
Gytis Dudas
Vladimir N Minin
Estimating effective population size changes from preferentially sampled genetic sequences.
PLoS Computational Biology
title Estimating effective population size changes from preferentially sampled genetic sequences.
title_full Estimating effective population size changes from preferentially sampled genetic sequences.
title_fullStr Estimating effective population size changes from preferentially sampled genetic sequences.
title_full_unstemmed Estimating effective population size changes from preferentially sampled genetic sequences.
title_short Estimating effective population size changes from preferentially sampled genetic sequences.
title_sort estimating effective population size changes from preferentially sampled genetic sequences
url https://doi.org/10.1371/journal.pcbi.1007774
work_keys_str_mv AT michaeldkarcher estimatingeffectivepopulationsizechangesfrompreferentiallysampledgeneticsequences
AT luizmaxcarvalho estimatingeffectivepopulationsizechangesfrompreferentiallysampledgeneticsequences
AT marcasuchard estimatingeffectivepopulationsizechangesfrompreferentiallysampledgeneticsequences
AT gytisdudas estimatingeffectivepopulationsizechangesfrompreferentiallysampledgeneticsequences
AT vladimirnminin estimatingeffectivepopulationsizechangesfrompreferentiallysampledgeneticsequences