A method for genome-wide genealogy estimation for thousands of samples
Knowledge of genome-wide genealogies for thousands of individuals would simplify most evolutionary analyses for humans and other species, but has remained computationally infeasible. We developed a method, Relate, scaling to > 10,000 sequences while simultaneously estimating branch lengths, m...
Main Authors: | , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
Springer Nature
2019
|
Subjects: |
_version_ | 1797101867343282176 |
---|---|
author | Speidel, L Forest, M Shi, S Myers, S |
author_facet | Speidel, L Forest, M Shi, S Myers, S |
author_sort | Speidel, L |
collection | OXFORD |
description | Knowledge of genome-wide genealogies for thousands of individuals would simplify most evolutionary analyses for humans and other species, but has remained computationally infeasible. We developed a method, Relate, scaling to > 10,000 sequences while simultaneously estimating branch lengths, mutational ages, and variable historical population sizes, as well as allowing for data errors. Application to 1000 Genomes Project haplotypes produces joint genealogical histories for 26 human populations. Highly diverged lineages are present in all groups, but most frequent in Africa. Outside Africa, these mainly reflect ancient introgression from groups related to Neanderthals and Denisovans, while African signals instead reflect unknown events, unique to that continent. Our approach allows more powerful inferences of natural selection than previously possible. We identify multiple novel regions under strong positive selection, and multi-allelic traits including hair color, body mass index (BMI), and blood pressure, showing strong evidence of directional selection, varying among human groups. |
first_indexed | 2024-03-07T05:58:01Z |
format | Journal article |
id | oxford-uuid:eb303c0c-f5e5-45b6-80b4-686bca64af7e |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T05:58:01Z |
publishDate | 2019 |
publisher | Springer Nature |
record_format | dspace |
spelling | oxford-uuid:eb303c0c-f5e5-45b6-80b4-686bca64af7e2022-03-27T11:07:52ZA method for genome-wide genealogy estimation for thousands of samplesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:eb303c0c-f5e5-45b6-80b4-686bca64af7epopulation geneticscomputational biology and bioinformaticsgenomicsEnglishSymplectic Elements at OxfordSpringer Nature2019Speidel, LForest, MShi, SMyers, SKnowledge of genome-wide genealogies for thousands of individuals would simplify most evolutionary analyses for humans and other species, but has remained computationally infeasible. We developed a method, Relate, scaling to > 10,000 sequences while simultaneously estimating branch lengths, mutational ages, and variable historical population sizes, as well as allowing for data errors. Application to 1000 Genomes Project haplotypes produces joint genealogical histories for 26 human populations. Highly diverged lineages are present in all groups, but most frequent in Africa. Outside Africa, these mainly reflect ancient introgression from groups related to Neanderthals and Denisovans, while African signals instead reflect unknown events, unique to that continent. Our approach allows more powerful inferences of natural selection than previously possible. We identify multiple novel regions under strong positive selection, and multi-allelic traits including hair color, body mass index (BMI), and blood pressure, showing strong evidence of directional selection, varying among human groups. |
spellingShingle | population genetics computational biology and bioinformatics genomics Speidel, L Forest, M Shi, S Myers, S A method for genome-wide genealogy estimation for thousands of samples |
title | A method for genome-wide genealogy estimation for thousands of samples |
title_full | A method for genome-wide genealogy estimation for thousands of samples |
title_fullStr | A method for genome-wide genealogy estimation for thousands of samples |
title_full_unstemmed | A method for genome-wide genealogy estimation for thousands of samples |
title_short | A method for genome-wide genealogy estimation for thousands of samples |
title_sort | method for genome wide genealogy estimation for thousands of samples |
topic | population genetics computational biology and bioinformatics genomics |
work_keys_str_mv | AT speidell amethodforgenomewidegenealogyestimationforthousandsofsamples AT forestm amethodforgenomewidegenealogyestimationforthousandsofsamples AT shis amethodforgenomewidegenealogyestimationforthousandsofsamples AT myerss amethodforgenomewidegenealogyestimationforthousandsofsamples AT speidell methodforgenomewidegenealogyestimationforthousandsofsamples AT forestm methodforgenomewidegenealogyestimationforthousandsofsamples AT shis methodforgenomewidegenealogyestimationforthousandsofsamples AT myerss methodforgenomewidegenealogyestimationforthousandsofsamples |