A method for genome-wide genealogy estimation for thousands of samples

Knowledge of genome-wide genealogies for thousands of individuals would simplify most evolutionary analyses for humans and other species, but has remained computationally infeasible. We developed a method, Relate, scaling to > 10,000 sequences while simultaneously estimating branch lengths, m...

Full description

Bibliographic Details
Main Authors: Speidel, L, Forest, M, Shi, S, Myers, S
Format: Journal article
Language:English
Published: Springer Nature 2019
Subjects:
_version_ 1797101867343282176
author Speidel, L
Forest, M
Shi, S
Myers, S
author_facet Speidel, L
Forest, M
Shi, S
Myers, S
author_sort Speidel, L
collection OXFORD
description Knowledge of genome-wide genealogies for thousands of individuals would simplify most evolutionary analyses for humans and other species, but has remained computationally infeasible. We developed a method, Relate, scaling to > 10,000 sequences while simultaneously estimating branch lengths, mutational ages, and variable historical population sizes, as well as allowing for data errors. Application to 1000 Genomes Project haplotypes produces joint genealogical histories for 26 human populations. Highly diverged lineages are present in all groups, but most frequent in Africa. Outside Africa, these mainly reflect ancient introgression from groups related to Neanderthals and Denisovans, while African signals instead reflect unknown events, unique to that continent. Our approach allows more powerful inferences of natural selection than previously possible. We identify multiple novel regions under strong positive selection, and multi-allelic traits including hair color, body mass index (BMI), and blood pressure, showing strong evidence of directional selection, varying among human groups.
first_indexed 2024-03-07T05:58:01Z
format Journal article
id oxford-uuid:eb303c0c-f5e5-45b6-80b4-686bca64af7e
institution University of Oxford
language English
last_indexed 2024-03-07T05:58:01Z
publishDate 2019
publisher Springer Nature
record_format dspace
spelling oxford-uuid:eb303c0c-f5e5-45b6-80b4-686bca64af7e2022-03-27T11:07:52ZA method for genome-wide genealogy estimation for thousands of samplesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:eb303c0c-f5e5-45b6-80b4-686bca64af7epopulation geneticscomputational biology and bioinformaticsgenomicsEnglishSymplectic Elements at OxfordSpringer Nature2019Speidel, LForest, MShi, SMyers, SKnowledge of genome-wide genealogies for thousands of individuals would simplify most evolutionary analyses for humans and other species, but has remained computationally infeasible. We developed a method, Relate, scaling to > 10,000 sequences while simultaneously estimating branch lengths, mutational ages, and variable historical population sizes, as well as allowing for data errors. Application to 1000 Genomes Project haplotypes produces joint genealogical histories for 26 human populations. Highly diverged lineages are present in all groups, but most frequent in Africa. Outside Africa, these mainly reflect ancient introgression from groups related to Neanderthals and Denisovans, while African signals instead reflect unknown events, unique to that continent. Our approach allows more powerful inferences of natural selection than previously possible. We identify multiple novel regions under strong positive selection, and multi-allelic traits including hair color, body mass index (BMI), and blood pressure, showing strong evidence of directional selection, varying among human groups.
spellingShingle population genetics
computational biology and bioinformatics
genomics
Speidel, L
Forest, M
Shi, S
Myers, S
A method for genome-wide genealogy estimation for thousands of samples
title A method for genome-wide genealogy estimation for thousands of samples
title_full A method for genome-wide genealogy estimation for thousands of samples
title_fullStr A method for genome-wide genealogy estimation for thousands of samples
title_full_unstemmed A method for genome-wide genealogy estimation for thousands of samples
title_short A method for genome-wide genealogy estimation for thousands of samples
title_sort method for genome wide genealogy estimation for thousands of samples
topic population genetics
computational biology and bioinformatics
genomics
work_keys_str_mv AT speidell amethodforgenomewidegenealogyestimationforthousandsofsamples
AT forestm amethodforgenomewidegenealogyestimationforthousandsofsamples
AT shis amethodforgenomewidegenealogyestimationforthousandsofsamples
AT myerss amethodforgenomewidegenealogyestimationforthousandsofsamples
AT speidell methodforgenomewidegenealogyestimationforthousandsofsamples
AT forestm methodforgenomewidegenealogyestimationforthousandsofsamples
AT shis methodforgenomewidegenealogyestimationforthousandsofsamples
AT myerss methodforgenomewidegenealogyestimationforthousandsofsamples