The UK Biobank resource with deep phenotyping and genomic data

The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-...

Full description

Bibliographic Details
Main Authors: Bycroft, C, Freeman, C, Petkova, D, Band, G, Elliott, L, Sharp, K, Motyer, A, Vukcevic, D, Delaneau, O, O'Connell, J, Cortes, A, Welsh, S, Young, A, Effingham, M, McVean, G, Leslie, S, Allen, N, Donnelly, P, Marchini, J
Format: Journal article
Language:English
Published: Nature Publishing Group 2018
_version_ 1797054968972181504
author Bycroft, C
Freeman, C
Petkova, D
Band, G
Elliott, L
Sharp, K
Motyer, A
Vukcevic, D
Delaneau, O
O'Connell, J
Cortes, A
Welsh, S
Young, A
Effingham, M
McVean, G
Leslie, S
Allen, N
Donnelly, P
Marchini, J
author_facet Bycroft, C
Freeman, C
Petkova, D
Band, G
Elliott, L
Sharp, K
Motyer, A
Vukcevic, D
Delaneau, O
O'Connell, J
Cortes, A
Welsh, S
Young, A
Effingham, M
McVean, G
Leslie, S
Allen, N
Donnelly, P
Marchini, J
author_sort Bycroft, C
collection OXFORD
description The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
first_indexed 2024-03-06T19:04:49Z
format Journal article
id oxford-uuid:14c6537f-ca7c-484f-bfec-25af424c0eee
institution University of Oxford
language English
last_indexed 2024-03-06T19:04:49Z
publishDate 2018
publisher Nature Publishing Group
record_format dspace
spelling oxford-uuid:14c6537f-ca7c-484f-bfec-25af424c0eee2022-03-26T10:21:45ZThe UK Biobank resource with deep phenotyping and genomic dataJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:14c6537f-ca7c-484f-bfec-25af424c0eeeEnglishSymplectic Elements at OxfordNature Publishing Group2018Bycroft, CFreeman, CPetkova, DBand, GElliott, LSharp, KMotyer, AVukcevic, DDelaneau, OO'Connell, JCortes, AWelsh, SYoung, AEffingham, MMcVean, GLeslie, SAllen, NDonnelly, PMarchini, JThe UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
spellingShingle Bycroft, C
Freeman, C
Petkova, D
Band, G
Elliott, L
Sharp, K
Motyer, A
Vukcevic, D
Delaneau, O
O'Connell, J
Cortes, A
Welsh, S
Young, A
Effingham, M
McVean, G
Leslie, S
Allen, N
Donnelly, P
Marchini, J
The UK Biobank resource with deep phenotyping and genomic data
title The UK Biobank resource with deep phenotyping and genomic data
title_full The UK Biobank resource with deep phenotyping and genomic data
title_fullStr The UK Biobank resource with deep phenotyping and genomic data
title_full_unstemmed The UK Biobank resource with deep phenotyping and genomic data
title_short The UK Biobank resource with deep phenotyping and genomic data
title_sort uk biobank resource with deep phenotyping and genomic data
work_keys_str_mv AT bycroftc theukbiobankresourcewithdeepphenotypingandgenomicdata
AT freemanc theukbiobankresourcewithdeepphenotypingandgenomicdata
AT petkovad theukbiobankresourcewithdeepphenotypingandgenomicdata
AT bandg theukbiobankresourcewithdeepphenotypingandgenomicdata
AT elliottl theukbiobankresourcewithdeepphenotypingandgenomicdata
AT sharpk theukbiobankresourcewithdeepphenotypingandgenomicdata
AT motyera theukbiobankresourcewithdeepphenotypingandgenomicdata
AT vukcevicd theukbiobankresourcewithdeepphenotypingandgenomicdata
AT delaneauo theukbiobankresourcewithdeepphenotypingandgenomicdata
AT oconnellj theukbiobankresourcewithdeepphenotypingandgenomicdata
AT cortesa theukbiobankresourcewithdeepphenotypingandgenomicdata
AT welshs theukbiobankresourcewithdeepphenotypingandgenomicdata
AT younga theukbiobankresourcewithdeepphenotypingandgenomicdata
AT effinghamm theukbiobankresourcewithdeepphenotypingandgenomicdata
AT mcveang theukbiobankresourcewithdeepphenotypingandgenomicdata
AT leslies theukbiobankresourcewithdeepphenotypingandgenomicdata
AT allenn theukbiobankresourcewithdeepphenotypingandgenomicdata
AT donnellyp theukbiobankresourcewithdeepphenotypingandgenomicdata
AT marchinij theukbiobankresourcewithdeepphenotypingandgenomicdata
AT bycroftc ukbiobankresourcewithdeepphenotypingandgenomicdata
AT freemanc ukbiobankresourcewithdeepphenotypingandgenomicdata
AT petkovad ukbiobankresourcewithdeepphenotypingandgenomicdata
AT bandg ukbiobankresourcewithdeepphenotypingandgenomicdata
AT elliottl ukbiobankresourcewithdeepphenotypingandgenomicdata
AT sharpk ukbiobankresourcewithdeepphenotypingandgenomicdata
AT motyera ukbiobankresourcewithdeepphenotypingandgenomicdata
AT vukcevicd ukbiobankresourcewithdeepphenotypingandgenomicdata
AT delaneauo ukbiobankresourcewithdeepphenotypingandgenomicdata
AT oconnellj ukbiobankresourcewithdeepphenotypingandgenomicdata
AT cortesa ukbiobankresourcewithdeepphenotypingandgenomicdata
AT welshs ukbiobankresourcewithdeepphenotypingandgenomicdata
AT younga ukbiobankresourcewithdeepphenotypingandgenomicdata
AT effinghamm ukbiobankresourcewithdeepphenotypingandgenomicdata
AT mcveang ukbiobankresourcewithdeepphenotypingandgenomicdata
AT leslies ukbiobankresourcewithdeepphenotypingandgenomicdata
AT allenn ukbiobankresourcewithdeepphenotypingandgenomicdata
AT donnellyp ukbiobankresourcewithdeepphenotypingandgenomicdata
AT marchinij ukbiobankresourcewithdeepphenotypingandgenomicdata