An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.

The Haplotype Map (HapMap) project recently generated genotype data for more than 1 million single-nucleotide polymorphisms (SNPs) in four population samples. The main application of the data is in the selection of tag single-nucleotide polymorphisms (tSNPs) to use in association studies. The useful...

Full description

Bibliographic Details
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2006-03-01
Series:PLoS Genetics
Online Access:http://dx.doi.org/10.1371/journal.pgen.0020027
_version_ 1818304506525384704
collection DOAJ
description The Haplotype Map (HapMap) project recently generated genotype data for more than 1 million single-nucleotide polymorphisms (SNPs) in four population samples. The main application of the data is in the selection of tag single-nucleotide polymorphisms (tSNPs) to use in association studies. The usefulness of this selection process needs to be verified in populations outside those used for the HapMap project. In addition, it is not known how well the data represent the general population, as only 90-120 chromosomes were used for each population and since the genotyped SNPs were selected so as to have high frequencies. In this study, we analyzed more than 1,000 individuals from Estonia. The population of this northern European country has been influenced by many different waves of migrations from Europe and Russia. We genotyped 1,536 randomly selected SNPs from two 500-kbp ENCODE regions on Chromosome 2. We observed that the tSNPs selected from the CEPH (Centre d'Etude du Polymorphisme Humain) from Utah (CEU) HapMap samples (derived from US residents with northern and western European ancestry) captured most of the variation in the Estonia sample. (Between 90% and 95% of the SNPs with a minor allele frequency of more than 5% have an r2 of at least 0.8 with one of the CEU tSNPs.) Using the reverse approach, tags selected from the Estonia sample could almost equally well describe the CEU sample. Finally, we observed that the sample size, the allelic frequency, and the SNP density in the dataset used to select the tags each have important effects on the tagging performance. Overall, our study supports the use of HapMap data in other Caucasian populations, but the SNP density and the bias towards high-frequency SNPs have to be taken into account when designing association studies.
first_indexed 2024-12-13T06:11:47Z
format Article
id doaj.art-1eb84bb6c6da455ba9562fd0a063bfc8
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-12-13T06:11:47Z
publishDate 2006-03-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-1eb84bb6c6da455ba9562fd0a063bfc82022-12-21T23:57:04ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042006-03-0123e27An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.The Haplotype Map (HapMap) project recently generated genotype data for more than 1 million single-nucleotide polymorphisms (SNPs) in four population samples. The main application of the data is in the selection of tag single-nucleotide polymorphisms (tSNPs) to use in association studies. The usefulness of this selection process needs to be verified in populations outside those used for the HapMap project. In addition, it is not known how well the data represent the general population, as only 90-120 chromosomes were used for each population and since the genotyped SNPs were selected so as to have high frequencies. In this study, we analyzed more than 1,000 individuals from Estonia. The population of this northern European country has been influenced by many different waves of migrations from Europe and Russia. We genotyped 1,536 randomly selected SNPs from two 500-kbp ENCODE regions on Chromosome 2. We observed that the tSNPs selected from the CEPH (Centre d'Etude du Polymorphisme Humain) from Utah (CEU) HapMap samples (derived from US residents with northern and western European ancestry) captured most of the variation in the Estonia sample. (Between 90% and 95% of the SNPs with a minor allele frequency of more than 5% have an r2 of at least 0.8 with one of the CEU tSNPs.) Using the reverse approach, tags selected from the Estonia sample could almost equally well describe the CEU sample. Finally, we observed that the sample size, the allelic frequency, and the SNP density in the dataset used to select the tags each have important effects on the tagging performance. Overall, our study supports the use of HapMap data in other Caucasian populations, but the SNP density and the bias towards high-frequency SNPs have to be taken into account when designing association studies.http://dx.doi.org/10.1371/journal.pgen.0020027
spellingShingle An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.
PLoS Genetics
title An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.
title_full An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.
title_fullStr An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.
title_full_unstemmed An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.
title_short An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.
title_sort evaluation of the performance of tag snps derived from hapmap in a caucasian population
url http://dx.doi.org/10.1371/journal.pgen.0020027