Magnitude of stratification in human populations and impacts on genome wide association studies.

Genome-wide association studies (GWAS) may be biased by population stratification (PS). We conducted empirical quantification of the magnitude of PS among human populations and its impact on GWAS. Liver tissues were collected from 979, 59 and 49 Caucasian Americans (CA), African Americans (AA) and H...

Full description

Bibliographic Details
Main Authors: Ke Hao, Eugene Chudin, Danielle Greenawalt, Eric E Schadt
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2010-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2805717?pdf=render
_version_ 1819236937606627328
author Ke Hao
Eugene Chudin
Danielle Greenawalt
Eric E Schadt
author_facet Ke Hao
Eugene Chudin
Danielle Greenawalt
Eric E Schadt
author_sort Ke Hao
collection DOAJ
description Genome-wide association studies (GWAS) may be biased by population stratification (PS). We conducted empirical quantification of the magnitude of PS among human populations and its impact on GWAS. Liver tissues were collected from 979, 59 and 49 Caucasian Americans (CA), African Americans (AA) and Hispanic Americans (HA), respectively, and genotyped using Illumina650Y (Ilmn650Y) arrays. RNA was also isolated and hybridized to Agilent whole-genome gene expression arrays. We propose a new method (i.e., hgdp-eigen) for detecting PS by projecting genotype vectors for each sample to the eigenvector space defined by the Human Genetic Diversity Panel (HGDP). Further, we conducted GWAS to map expression quantitative trait loci (eQTL) for the approximately 40,000 liver gene expression traits monitored by the Agilent arrays. HGDP-eigen performed similarly to the conventional self-eigen methods in capturing PS. However, leveraging the HGDP offered a significant advantage in revealing the origins, directions and magnitude of PS. Adjusting for eigenvectors had minor impacts on eQTL detection rates in CA. In contrast, for AA and HA, adjustment dramatically reduced association findings. At an FDR = 10%, we identified 65 eQTLs in AA with the unadjusted analysis, but only 18 eQTLs after the eigenvector adjustment. Strikingly, 55 out of the 65 unadjusted AA eQTLs were validated in CA, indicating that the adjustment procedure significantly reduced GWAS power. A number of the 55 AA eQTLs validated in CA overlapped with published disease associated SNPs. For example, rs646776 and rs10903129 have previously been associated with lipid levels and coronary heart disease risk, however, the rs10903129 eQTL was missed in the eigenvector adjusted analysis.
first_indexed 2024-12-23T13:12:22Z
format Article
id doaj.art-eefe934b6d4b48c785aea7fddf8fe131
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-23T13:12:22Z
publishDate 2010-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-eefe934b6d4b48c785aea7fddf8fe1312022-12-21T17:45:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-01-0151e869510.1371/journal.pone.0008695Magnitude of stratification in human populations and impacts on genome wide association studies.Ke HaoEugene ChudinDanielle GreenawaltEric E SchadtGenome-wide association studies (GWAS) may be biased by population stratification (PS). We conducted empirical quantification of the magnitude of PS among human populations and its impact on GWAS. Liver tissues were collected from 979, 59 and 49 Caucasian Americans (CA), African Americans (AA) and Hispanic Americans (HA), respectively, and genotyped using Illumina650Y (Ilmn650Y) arrays. RNA was also isolated and hybridized to Agilent whole-genome gene expression arrays. We propose a new method (i.e., hgdp-eigen) for detecting PS by projecting genotype vectors for each sample to the eigenvector space defined by the Human Genetic Diversity Panel (HGDP). Further, we conducted GWAS to map expression quantitative trait loci (eQTL) for the approximately 40,000 liver gene expression traits monitored by the Agilent arrays. HGDP-eigen performed similarly to the conventional self-eigen methods in capturing PS. However, leveraging the HGDP offered a significant advantage in revealing the origins, directions and magnitude of PS. Adjusting for eigenvectors had minor impacts on eQTL detection rates in CA. In contrast, for AA and HA, adjustment dramatically reduced association findings. At an FDR = 10%, we identified 65 eQTLs in AA with the unadjusted analysis, but only 18 eQTLs after the eigenvector adjustment. Strikingly, 55 out of the 65 unadjusted AA eQTLs were validated in CA, indicating that the adjustment procedure significantly reduced GWAS power. A number of the 55 AA eQTLs validated in CA overlapped with published disease associated SNPs. For example, rs646776 and rs10903129 have previously been associated with lipid levels and coronary heart disease risk, however, the rs10903129 eQTL was missed in the eigenvector adjusted analysis.http://europepmc.org/articles/PMC2805717?pdf=render
spellingShingle Ke Hao
Eugene Chudin
Danielle Greenawalt
Eric E Schadt
Magnitude of stratification in human populations and impacts on genome wide association studies.
PLoS ONE
title Magnitude of stratification in human populations and impacts on genome wide association studies.
title_full Magnitude of stratification in human populations and impacts on genome wide association studies.
title_fullStr Magnitude of stratification in human populations and impacts on genome wide association studies.
title_full_unstemmed Magnitude of stratification in human populations and impacts on genome wide association studies.
title_short Magnitude of stratification in human populations and impacts on genome wide association studies.
title_sort magnitude of stratification in human populations and impacts on genome wide association studies
url http://europepmc.org/articles/PMC2805717?pdf=render
work_keys_str_mv AT kehao magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies
AT eugenechudin magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies
AT daniellegreenawalt magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies
AT ericeschadt magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies