An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.
A substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used...
Main Authors: | , , , , , , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
2005
|
_version_ | 1797098411029168128 |
---|---|
author | Zeggini, E Rayner, N Morris, A Hattersley, A Walker, M Hitman, G Deloukas, P Cardon, L McCarthy, M |
author_facet | Zeggini, E Rayner, N Morris, A Hattersley, A Walker, M Hitman, G Deloukas, P Cardon, L McCarthy, M |
author_sort | Zeggini, E |
collection | OXFORD |
description | A substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used in the HapMap project are sufficient to capture common variation, but that performance declines substantially for variants with minor allele frequencies of <5%. |
first_indexed | 2024-03-07T05:09:09Z |
format | Journal article |
id | oxford-uuid:daf7ae35-35a6-46fb-bbf1-50933f9b851d |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T05:09:09Z |
publishDate | 2005 |
record_format | dspace |
spelling | oxford-uuid:daf7ae35-35a6-46fb-bbf1-50933f9b851d2022-03-27T09:06:59ZAn evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:daf7ae35-35a6-46fb-bbf1-50933f9b851dEnglishSymplectic Elements at Oxford2005Zeggini, ERayner, NMorris, AHattersley, AWalker, MHitman, GDeloukas, PCardon, LMcCarthy, MA substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used in the HapMap project are sufficient to capture common variation, but that performance declines substantially for variants with minor allele frequencies of <5%. |
spellingShingle | Zeggini, E Rayner, N Morris, A Hattersley, A Walker, M Hitman, G Deloukas, P Cardon, L McCarthy, M An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. |
title | An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. |
title_full | An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. |
title_fullStr | An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. |
title_full_unstemmed | An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. |
title_short | An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. |
title_sort | evaluation of hapmap sample size and tagging snp performance in large scale empirical and simulated data sets |
work_keys_str_mv | AT zegginie anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT raynern anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT morrisa anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT hattersleya anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT walkerm anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT hitmang anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT deloukasp anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT cardonl anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT mccarthym anevaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT zegginie evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT raynern evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT morrisa evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT hattersleya evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT walkerm evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT hitmang evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT deloukasp evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT cardonl evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets AT mccarthym evaluationofhapmapsamplesizeandtaggingsnpperformanceinlargescaleempiricalandsimulateddatasets |