A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.

The impact of erroneous genotypes having passed standard quality control (QC) can be severe in genome-wide association studies, genotype imputation, and estimation of heritability and prediction of genetic risk based on single nucleotide polymorphisms (SNP). To detect such genotyping errors, a simpl...

Olles dieđut

Bibliográfalaš dieđut
Váldodahkkit: Lee, S, Nyholt, DR, Macgregor, S, Henders, A, Zondervan, K, Montgomery, G, Visscher, P
Materiálatiipa: Journal article
Giella:English
Almmustuhtton: 2010
_version_ 1826293802756734976
author Lee, S
Nyholt, DR
Macgregor, S
Henders, A
Zondervan, K
Montgomery, G
Visscher, P
author_facet Lee, S
Nyholt, DR
Macgregor, S
Henders, A
Zondervan, K
Montgomery, G
Visscher, P
author_sort Lee, S
collection OXFORD
description The impact of erroneous genotypes having passed standard quality control (QC) can be severe in genome-wide association studies, genotype imputation, and estimation of heritability and prediction of genetic risk based on single nucleotide polymorphisms (SNP). To detect such genotyping errors, a simple two-locus QC method, based on the difference in test statistic of association between single SNPs and pairs of SNPs, was developed and applied. The proposed approach could detect many problematic SNPs with statistical significance even when standard single SNP QC analyses fail to detect them in real data. Depending on the data set used, the number of erroneous SNPs that were not filtered out by standard single SNP QC but detected by the proposed approach varied from a few hundred to thousands. Using simulated data, it was shown that the proposed method was powerful and performed better than other tested existing methods. The power of the proposed approach to detect erroneous genotypes was ∼80% for a 3% error rate per SNP. This novel QC approach is easy to implement and computationally efficient, and can lead to a better quality of genotypes for subsequent genotype-phenotype investigations.
first_indexed 2024-03-07T03:35:47Z
format Journal article
id oxford-uuid:bc3ba18e-b742-41dd-8a2e-d6c1d46186b7
institution University of Oxford
language English
last_indexed 2024-03-07T03:35:47Z
publishDate 2010
record_format dspace
spelling oxford-uuid:bc3ba18e-b742-41dd-8a2e-d6c1d46186b72022-03-27T05:22:53ZA simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:bc3ba18e-b742-41dd-8a2e-d6c1d46186b7EnglishSymplectic Elements at Oxford2010Lee, SNyholt, DRMacgregor, SHenders, AZondervan, KMontgomery, GVisscher, PThe impact of erroneous genotypes having passed standard quality control (QC) can be severe in genome-wide association studies, genotype imputation, and estimation of heritability and prediction of genetic risk based on single nucleotide polymorphisms (SNP). To detect such genotyping errors, a simple two-locus QC method, based on the difference in test statistic of association between single SNPs and pairs of SNPs, was developed and applied. The proposed approach could detect many problematic SNPs with statistical significance even when standard single SNP QC analyses fail to detect them in real data. Depending on the data set used, the number of erroneous SNPs that were not filtered out by standard single SNP QC but detected by the proposed approach varied from a few hundred to thousands. Using simulated data, it was shown that the proposed method was powerful and performed better than other tested existing methods. The power of the proposed approach to detect erroneous genotypes was ∼80% for a 3% error rate per SNP. This novel QC approach is easy to implement and computationally efficient, and can lead to a better quality of genotypes for subsequent genotype-phenotype investigations.
spellingShingle Lee, S
Nyholt, DR
Macgregor, S
Henders, A
Zondervan, K
Montgomery, G
Visscher, P
A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.
title A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.
title_full A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.
title_fullStr A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.
title_full_unstemmed A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.
title_short A simple and fast two-locus quality control test to detect false positives due to batch effects in genome-wide association studies.
title_sort simple and fast two locus quality control test to detect false positives due to batch effects in genome wide association studies
work_keys_str_mv AT lees asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT nyholtdr asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT macgregors asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT hendersa asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT zondervank asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT montgomeryg asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT visscherp asimpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT lees simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT nyholtdr simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT macgregors simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT hendersa simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT zondervank simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT montgomeryg simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies
AT visscherp simpleandfasttwolocusqualitycontroltesttodetectfalsepositivesduetobatcheffectsingenomewideassociationstudies