On testing for genetic association in case-control studies when population allele frequencies are known.

With the emergence of Biobanks alongside large-scale genome-wide association studies (GWAS) we will soon be in the enviable situation of obtaining precise estimates of population allele frequencies for SNPs which make up the panels in standard genotyping arrays, such as those produced from Illumina...

Full description

Bibliographic Details
Main Authors: Antonyuk, A, Holmes, C
Format: Journal article
Language:English
Published: 2009
_version_ 1797053050566737920
author Antonyuk, A
Holmes, C
author_facet Antonyuk, A
Holmes, C
author_sort Antonyuk, A
collection OXFORD
description With the emergence of Biobanks alongside large-scale genome-wide association studies (GWAS) we will soon be in the enviable situation of obtaining precise estimates of population allele frequencies for SNPs which make up the panels in standard genotyping arrays, such as those produced from Illumina and Affymetrix. For disease association studies it is well known that for rare diseases with known population minor allele frequencies (pMAFs) a case-only design is most powerful. That is, for a fixed budget the optimal procedure is to genotype only cases (affecteds). In such tests experimenters look for a divergence from allele distribution in cases from that of the known population pMAF; in order to test the null hypothesis of no association between the disease status and the allele frequency. However, what has not been previously characterized is the utility of controls (known unaffecteds) when available. In this study we consider frequentist and Bayesian statistical methods for testing for SNP genotype association when population MAFs are known and when both cases and controls are available. We demonstrate that for rare diseases the most powerful frequentist design is, somewhat counterintuitively, to actively discard the controls even though they contain information on the association. In contrast we develop a Bayesian test which uses all available information (cases and controls) and appears to exhibit uniformaly greater power than all frequentist methods we considered.
first_indexed 2024-03-06T18:38:41Z
format Journal article
id oxford-uuid:0c28e820-b0ef-4d22-a46b-ffdda4285178
institution University of Oxford
language English
last_indexed 2024-03-06T18:38:41Z
publishDate 2009
record_format dspace
spelling oxford-uuid:0c28e820-b0ef-4d22-a46b-ffdda42851782022-03-26T09:33:22ZOn testing for genetic association in case-control studies when population allele frequencies are known.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:0c28e820-b0ef-4d22-a46b-ffdda4285178EnglishSymplectic Elements at Oxford2009Antonyuk, AHolmes, CWith the emergence of Biobanks alongside large-scale genome-wide association studies (GWAS) we will soon be in the enviable situation of obtaining precise estimates of population allele frequencies for SNPs which make up the panels in standard genotyping arrays, such as those produced from Illumina and Affymetrix. For disease association studies it is well known that for rare diseases with known population minor allele frequencies (pMAFs) a case-only design is most powerful. That is, for a fixed budget the optimal procedure is to genotype only cases (affecteds). In such tests experimenters look for a divergence from allele distribution in cases from that of the known population pMAF; in order to test the null hypothesis of no association between the disease status and the allele frequency. However, what has not been previously characterized is the utility of controls (known unaffecteds) when available. In this study we consider frequentist and Bayesian statistical methods for testing for SNP genotype association when population MAFs are known and when both cases and controls are available. We demonstrate that for rare diseases the most powerful frequentist design is, somewhat counterintuitively, to actively discard the controls even though they contain information on the association. In contrast we develop a Bayesian test which uses all available information (cases and controls) and appears to exhibit uniformaly greater power than all frequentist methods we considered.
spellingShingle Antonyuk, A
Holmes, C
On testing for genetic association in case-control studies when population allele frequencies are known.
title On testing for genetic association in case-control studies when population allele frequencies are known.
title_full On testing for genetic association in case-control studies when population allele frequencies are known.
title_fullStr On testing for genetic association in case-control studies when population allele frequencies are known.
title_full_unstemmed On testing for genetic association in case-control studies when population allele frequencies are known.
title_short On testing for genetic association in case-control studies when population allele frequencies are known.
title_sort on testing for genetic association in case control studies when population allele frequencies are known
work_keys_str_mv AT antonyuka ontestingforgeneticassociationincasecontrolstudieswhenpopulationallelefrequenciesareknown
AT holmesc ontestingforgeneticassociationincasecontrolstudieswhenpopulationallelefrequenciesareknown