The likelihood of gene trees under selective models

<p>The extent to which natural selection shapes diversity within populations is a key question for population genetics. Thus, there is considerable interest in quantifying the strength of selection. In this thesis a full likelihood approach for inference about selection at a single site withi...

Full description

Bibliographic Details
Main Author: Coop, GM
Other Authors: Griffiths, R
Format: Thesis
Language:English
Published: 2004
Subjects:
_version_ 1797109068881461248
author Coop, GM
author2 Griffiths, R
author_facet Griffiths, R
Coop, GM
author_sort Coop, GM
collection OXFORD
description <p>The extent to which natural selection shapes diversity within populations is a key question for population genetics. Thus, there is considerable interest in quantifying the strength of selection. In this thesis a full likelihood approach for inference about selection at a single site within an otherwise neutral fully-linked sequence of sites is developed. Integral to many of the ideas introduced in this thesis is the reversibility of the diffusion process, and some past approaches to this concept are reviewed. A coalescent model of evolution is used to model the ancestry of a sample of DNA sequences which have the selected site segregating. A novel method for simulating the coalescent with selection, acting at a single biallelic site, is described. Selection is incorporated through modelling the frequency of the selected and neutral allelic classes stochastically back in time. The ancestry is then simulated using a subdivided population model considering the population frequencies through time as variable population sizes. The approach is general and can be used for any selection scheme at a biallelic locus. The mutation model, for the selected and neutral sites, is the infinitely-many-sites model where there is no back or parallel mutation at sites. This allows a unique perfect phylogeny, a gene tree, to be constructed from the configuration of mutations on the sample sequences. An importance sampling algorithm is described to explore over coalescent tree space consistent with this gene tree. The method is used to assess the evidence for selection in a number of data sets. These are as follows: a partial selective sweep in the G6PD gene (Verrelli et al., 2002); a recent full sweep in the Factor IX gene (Harris and Hey, 2001); and balancing selection in the DCP1 gene (Rieder et al., 1999). Little evidence of the action of selection is found in the data set of Verrelli et al. (2002) and the data set of Rieder et al. (1999) seems inconsistent with the model of balancing selection. The patterns of diversity in the data set of Harris and Hey (2001) offer support of the hypothesis of a full sweep.</p>
first_indexed 2024-03-07T07:36:53Z
format Thesis
id oxford-uuid:ba97d36c-61c1-40c8-a1f4-e7ddc8918d5b
institution University of Oxford
language English
last_indexed 2024-03-07T07:36:53Z
publishDate 2004
record_format dspace
spelling oxford-uuid:ba97d36c-61c1-40c8-a1f4-e7ddc8918d5b2023-03-14T12:00:54ZThe likelihood of gene trees under selective modelsThesishttp://purl.org/coar/resource_type/c_db06uuid:ba97d36c-61c1-40c8-a1f4-e7ddc8918d5bEvolution (Biology)Population geneticsStatistical methodsNatural selectionEnglishPolonsky Theses Digitisation Project2004Coop, GMGriffiths, R<p>The extent to which natural selection shapes diversity within populations is a key question for population genetics. Thus, there is considerable interest in quantifying the strength of selection. In this thesis a full likelihood approach for inference about selection at a single site within an otherwise neutral fully-linked sequence of sites is developed. Integral to many of the ideas introduced in this thesis is the reversibility of the diffusion process, and some past approaches to this concept are reviewed. A coalescent model of evolution is used to model the ancestry of a sample of DNA sequences which have the selected site segregating. A novel method for simulating the coalescent with selection, acting at a single biallelic site, is described. Selection is incorporated through modelling the frequency of the selected and neutral allelic classes stochastically back in time. The ancestry is then simulated using a subdivided population model considering the population frequencies through time as variable population sizes. The approach is general and can be used for any selection scheme at a biallelic locus. The mutation model, for the selected and neutral sites, is the infinitely-many-sites model where there is no back or parallel mutation at sites. This allows a unique perfect phylogeny, a gene tree, to be constructed from the configuration of mutations on the sample sequences. An importance sampling algorithm is described to explore over coalescent tree space consistent with this gene tree. The method is used to assess the evidence for selection in a number of data sets. These are as follows: a partial selective sweep in the G6PD gene (Verrelli et al., 2002); a recent full sweep in the Factor IX gene (Harris and Hey, 2001); and balancing selection in the DCP1 gene (Rieder et al., 1999). Little evidence of the action of selection is found in the data set of Verrelli et al. (2002) and the data set of Rieder et al. (1999) seems inconsistent with the model of balancing selection. The patterns of diversity in the data set of Harris and Hey (2001) offer support of the hypothesis of a full sweep.</p>
spellingShingle Evolution (Biology)
Population genetics
Statistical methods
Natural selection
Coop, GM
The likelihood of gene trees under selective models
title The likelihood of gene trees under selective models
title_full The likelihood of gene trees under selective models
title_fullStr The likelihood of gene trees under selective models
title_full_unstemmed The likelihood of gene trees under selective models
title_short The likelihood of gene trees under selective models
title_sort likelihood of gene trees under selective models
topic Evolution (Biology)
Population genetics
Statistical methods
Natural selection
work_keys_str_mv AT coopgm thelikelihoodofgenetreesunderselectivemodels
AT coopgm likelihoodofgenetreesunderselectivemodels