Haplotype estimation for biobank-scale data sets

The UK Biobank (UKB) has recently released genotypes on 152,328 individuals together with extensive phenotypic and lifestyle information. We present a new phasing method, SHAPEIT3, that can handle such biobank-scale data sets and results in switch error rates as low as ∼0.3%. The method exhibits O(N...

Full description

Bibliographic Details
Main Authors: O'Connell, J, Sharp, K, Shrine, N, Wain, L, Hall, I, Tobin, M, Zagury, J, Delaneau, O, Marchini, J
Format: Journal article
Language:English
Published: Nature Publishing Group 2016
_version_ 1797050930165710848
author O'Connell, J
Sharp, K
Shrine, N
Wain, L
Hall, I
Tobin, M
Zagury, J
Delaneau, O
Marchini, J
author_facet O'Connell, J
Sharp, K
Shrine, N
Wain, L
Hall, I
Tobin, M
Zagury, J
Delaneau, O
Marchini, J
author_sort O'Connell, J
collection OXFORD
description The UK Biobank (UKB) has recently released genotypes on 152,328 individuals together with extensive phenotypic and lifestyle information. We present a new phasing method, SHAPEIT3, that can handle such biobank-scale data sets and results in switch error rates as low as ∼0.3%. The method exhibits O(NlogN) scaling with sample size N, enabling fast and accurate phasing of even larger cohorts.
first_indexed 2024-03-06T18:12:30Z
format Journal article
id oxford-uuid:0380d180-f5a4-4967-9aec-c9c1698bddd3
institution University of Oxford
language English
last_indexed 2024-03-06T18:12:30Z
publishDate 2016
publisher Nature Publishing Group
record_format dspace
spelling oxford-uuid:0380d180-f5a4-4967-9aec-c9c1698bddd32022-03-26T08:46:37ZHaplotype estimation for biobank-scale data setsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:0380d180-f5a4-4967-9aec-c9c1698bddd3EnglishSymplectic Elements at OxfordNature Publishing Group2016O'Connell, JSharp, KShrine, NWain, LHall, ITobin, MZagury, JDelaneau, OMarchini, JThe UK Biobank (UKB) has recently released genotypes on 152,328 individuals together with extensive phenotypic and lifestyle information. We present a new phasing method, SHAPEIT3, that can handle such biobank-scale data sets and results in switch error rates as low as ∼0.3%. The method exhibits O(NlogN) scaling with sample size N, enabling fast and accurate phasing of even larger cohorts.
spellingShingle O'Connell, J
Sharp, K
Shrine, N
Wain, L
Hall, I
Tobin, M
Zagury, J
Delaneau, O
Marchini, J
Haplotype estimation for biobank-scale data sets
title Haplotype estimation for biobank-scale data sets
title_full Haplotype estimation for biobank-scale data sets
title_fullStr Haplotype estimation for biobank-scale data sets
title_full_unstemmed Haplotype estimation for biobank-scale data sets
title_short Haplotype estimation for biobank-scale data sets
title_sort haplotype estimation for biobank scale data sets
work_keys_str_mv AT oconnellj haplotypeestimationforbiobankscaledatasets
AT sharpk haplotypeestimationforbiobankscaledatasets
AT shrinen haplotypeestimationforbiobankscaledatasets
AT wainl haplotypeestimationforbiobankscaledatasets
AT halli haplotypeestimationforbiobankscaledatasets
AT tobinm haplotypeestimationforbiobankscaledatasets
AT zaguryj haplotypeestimationforbiobankscaledatasets
AT delaneauo haplotypeestimationforbiobankscaledatasets
AT marchinij haplotypeestimationforbiobankscaledatasets