An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.

A substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used...

Full description

Bibliographic Details
Main Authors: Zeggini, E, Rayner, N, Morris, A, Hattersley, A, Walker, M, Hitman, G, Deloukas, P, Cardon, L, McCarthy, M
Format: Journal article
Language:English
Published: 2005
_version_ 1797098411029168128
author Zeggini, E
Rayner, N
Morris, A
Hattersley, A
Walker, M
Hitman, G
Deloukas, P
Cardon, L
McCarthy, M
author_facet Zeggini, E
Rayner, N
Morris, A
Hattersley, A
Walker, M
Hitman, G
Deloukas, P
Cardon, L
McCarthy, M
author_sort Zeggini, E
collection OXFORD
description A substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used in the HapMap project are sufficient to capture common variation, but that performance declines substantially for variants with minor allele frequencies of <5%.
first_indexed 2024-03-07T05:09:09Z
format Journal article
id oxford-uuid:daf7ae35-35a6-46fb-bbf1-50933f9b851d
institution University of Oxford
language English
last_indexed 2024-03-07T05:09:09Z
publishDate 2005
record_format dspace
spelling oxford-uuid:daf7ae35-35a6-46fb-bbf1-50933f9b851d2022-03-27T09:06:59ZAn evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:daf7ae35-35a6-46fb-bbf1-50933f9b851dEnglishSymplectic Elements at Oxford2005Zeggini, ERayner, NMorris, AHattersley, AWalker, MHitman, GDeloukas, PCardon, LMcCarthy, MA substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used in the HapMap project are sufficient to capture common variation, but that performance declines substantially for variants with minor allele frequencies of <5%.
spellingShingle Zeggini, E
Rayner, N
Morris, A
Hattersley, A
Walker, M
Hitman, G
Deloukas, P
Cardon, L
McCarthy, M
An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
title An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
title_full An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
title_fullStr An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
title_full_unstemmed An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
title_short An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
title_sort evaluation of hapmap sample size and tagging snp performance in large scale empirical and simulated data sets
work_keys_str_mv AT zegginie anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT raynern anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT morrisa anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT hattersleya anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT walkerm anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT hitmang anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT deloukasp anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT cardonl anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT mccarthym anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT zegginie evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT raynern evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT morrisa evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT hattersleya evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT walkerm evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT hitmang evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT deloukasp evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT cardonl evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets
AT mccarthym evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets