Genetic architecture of gene expression traits across diverse populations.

For many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-Eu...

Full description

Bibliographic Details
Main Authors: Lauren S Mogil, Angela Andaleon, Alexa Badalamenti, Scott P Dickinson, Xiuqing Guo, Jerome I Rotter, W Craig Johnson, Hae Kyung Im, Yongmei Liu, Heather E Wheeler
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-08-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC6105030?pdf=render
_version_ 1819154308991549440
author Lauren S Mogil
Angela Andaleon
Alexa Badalamenti
Scott P Dickinson
Xiuqing Guo
Jerome I Rotter
W Craig Johnson
Hae Kyung Im
Yongmei Liu
Heather E Wheeler
author_facet Lauren S Mogil
Angela Andaleon
Alexa Badalamenti
Scott P Dickinson
Xiuqing Guo
Jerome I Rotter
W Craig Johnson
Hae Kyung Im
Yongmei Liu
Heather E Wheeler
author_sort Lauren S Mogil
collection DOAJ
description For many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-European ancestry. We used data from the MESA (Multi-Ethnic Study of Atherosclerosis) cohort to characterize the genetic architecture of gene expression within and between diverse populations. Genotype and monocyte gene expression were available in individuals with African American (AFA, n = 233), Hispanic (HIS, n = 352), and European (CAU, n = 578) ancestry. We performed expression quantitative trait loci (eQTL) mapping in each population and show genetic correlation of gene expression depends on shared ancestry proportions. Using elastic net modeling with cross validation to optimize genotypic predictors of gene expression in each population, we show the genetic architecture of gene expression for most predictable genes is sparse. We found the best predicted gene in each population, TACSTD2 in AFA and CHURC1 in CAU and HIS, had similar prediction performance across populations with R2 > 0.8 in each population. However, we identified a subset of genes that are well-predicted in one population, but poorly predicted in another. We show these differences in predictive performance are due to allele frequency differences between populations. Using genotype weights trained in MESA to predict gene expression in independent populations showed that a training set with ancestry similar to the test set is better at predicting gene expression in test populations, demonstrating an urgent need for diverse population sampling in genomics. Our predictive models and performance statistics in diverse cohorts are made publicly available for use in transcriptome mapping methods at https://github.com/WheelerLab/DivPop.
first_indexed 2024-12-22T15:19:02Z
format Article
id doaj.art-2eb6769fcca84fc1bca33cec87f48f46
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-12-22T15:19:02Z
publishDate 2018-08-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-2eb6769fcca84fc1bca33cec87f48f462022-12-21T18:21:41ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042018-08-01148e100758610.1371/journal.pgen.1007586Genetic architecture of gene expression traits across diverse populations.Lauren S MogilAngela AndaleonAlexa BadalamentiScott P DickinsonXiuqing GuoJerome I RotterW Craig JohnsonHae Kyung ImYongmei LiuHeather E WheelerFor many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-European ancestry. We used data from the MESA (Multi-Ethnic Study of Atherosclerosis) cohort to characterize the genetic architecture of gene expression within and between diverse populations. Genotype and monocyte gene expression were available in individuals with African American (AFA, n = 233), Hispanic (HIS, n = 352), and European (CAU, n = 578) ancestry. We performed expression quantitative trait loci (eQTL) mapping in each population and show genetic correlation of gene expression depends on shared ancestry proportions. Using elastic net modeling with cross validation to optimize genotypic predictors of gene expression in each population, we show the genetic architecture of gene expression for most predictable genes is sparse. We found the best predicted gene in each population, TACSTD2 in AFA and CHURC1 in CAU and HIS, had similar prediction performance across populations with R2 > 0.8 in each population. However, we identified a subset of genes that are well-predicted in one population, but poorly predicted in another. We show these differences in predictive performance are due to allele frequency differences between populations. Using genotype weights trained in MESA to predict gene expression in independent populations showed that a training set with ancestry similar to the test set is better at predicting gene expression in test populations, demonstrating an urgent need for diverse population sampling in genomics. Our predictive models and performance statistics in diverse cohorts are made publicly available for use in transcriptome mapping methods at https://github.com/WheelerLab/DivPop.http://europepmc.org/articles/PMC6105030?pdf=render
spellingShingle Lauren S Mogil
Angela Andaleon
Alexa Badalamenti
Scott P Dickinson
Xiuqing Guo
Jerome I Rotter
W Craig Johnson
Hae Kyung Im
Yongmei Liu
Heather E Wheeler
Genetic architecture of gene expression traits across diverse populations.
PLoS Genetics
title Genetic architecture of gene expression traits across diverse populations.
title_full Genetic architecture of gene expression traits across diverse populations.
title_fullStr Genetic architecture of gene expression traits across diverse populations.
title_full_unstemmed Genetic architecture of gene expression traits across diverse populations.
title_short Genetic architecture of gene expression traits across diverse populations.
title_sort genetic architecture of gene expression traits across diverse populations
url http://europepmc.org/articles/PMC6105030?pdf=render
work_keys_str_mv AT laurensmogil geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT angelaandaleon geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT alexabadalamenti geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT scottpdickinson geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT xiuqingguo geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT jeromeirotter geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT wcraigjohnson geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT haekyungim geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT yongmeiliu geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT heatherewheeler geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations