A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.

Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000...

Full description

Bibliographic Details
Main Authors: Howie, B, Donnelly, P, Marchini, J
Format: Journal article
Language:English
Published: 2009
_version_ 1826274571519524864
author Howie, B
Donnelly, P
Marchini, J
author_facet Howie, B
Donnelly, P
Marchini, J
author_sort Howie, B
collection OXFORD
description Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%-20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions.
first_indexed 2024-03-06T22:45:28Z
format Journal article
id oxford-uuid:5d0578c5-a55f-4982-9c88-a5e1df19ae95
institution University of Oxford
language English
last_indexed 2024-03-06T22:45:28Z
publishDate 2009
record_format dspace
spelling oxford-uuid:5d0578c5-a55f-4982-9c88-a5e1df19ae952022-03-26T17:31:45ZA flexible and accurate genotype imputation method for the next generation of genome-wide association studies.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:5d0578c5-a55f-4982-9c88-a5e1df19ae95EnglishSymplectic Elements at Oxford2009Howie, BDonnelly, PMarchini, JGenotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%-20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions.
spellingShingle Howie, B
Donnelly, P
Marchini, J
A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.
title A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.
title_full A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.
title_fullStr A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.
title_full_unstemmed A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.
title_short A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.
title_sort flexible and accurate genotype imputation method for the next generation of genome wide association studies
work_keys_str_mv AT howieb aflexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT donnellyp aflexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT marchinij aflexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT howieb flexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT donnellyp flexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT marchinij flexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies