Mapping eQTL by leveraging multiple tissues and DNA methylation

Abstract Background DNA methylation is an important tissue-specific epigenetic event that influences transcriptional regulation of gene expression. Differentially methylated CpG sites may act as mediators between genetic variation and gene expression, and this relationship can be exploited while map...

Full description

Bibliographic Details
Main Authors: Chaitanya R. Acharya, Kouros Owzar, Andrew S. Allen
Format: Article
Language:English
Published: BMC 2017-10-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-017-1856-9
_version_ 1818676082858000384
author Chaitanya R. Acharya
Kouros Owzar
Andrew S. Allen
author_facet Chaitanya R. Acharya
Kouros Owzar
Andrew S. Allen
author_sort Chaitanya R. Acharya
collection DOAJ
description Abstract Background DNA methylation is an important tissue-specific epigenetic event that influences transcriptional regulation of gene expression. Differentially methylated CpG sites may act as mediators between genetic variation and gene expression, and this relationship can be exploited while mapping multi-tissue expression quantitative trait loci (eQTL). Current multi-tissue eQTL mapping techniques are limited to only exploiting gene expression patterns across multiple tissues either in a joint tissue or tissue-by-tissue frameworks. We present a new statistical approach that enables us to model the effect of germ-line variation on tissue-specific gene expression in the presence of effects due to DNA methylation. Results Our method efficiently models genetic and epigenetic variation to identify genomic regions of interest containing combinations of mRNA transcripts, CpG sites, and SNPs by jointly testing for genotypic effect and higher order interaction effects between genotype, methylation and tissues. We demonstrate using Monte Carlo simulations that our approach, in the presence of both genetic and DNA methylation effects, gives an improved performance (in terms of statistical power) to detect eQTLs over the current eQTL mapping approaches. When applied to an array-based dataset from 150 neuropathologically normal adult human brains, our method identifies eQTLs that were undetected using standard tissue-by-tissue or joint tissue eQTL mapping techniques. As an example, our method identifies eQTLs by leveraging methylated CpG sites in a LIM homeobox member gene (LHX9), which may have a role in the neural development. Conclusions Our score test-based approach does not need parameter estimation under the alternative hypothesis. As a result, our model parameters are estimated only once for each mRNA - CpG pair. Our model specifically studies the effects of non-coding regions of DNA (in this case, CpG sites) on mapping eQTLs. However, we can easily model micro-RNAs instead of CpG sites to study the effects of post-transcriptional events in mapping eQTL. Our model’s flexible framework also allows us to investigate other genomic events such as alternative gene splicing by extending our model to include gene isoform-specific data.
first_indexed 2024-12-17T08:37:50Z
format Article
id doaj.art-6374b84fecdf4986a1318eae9e8ac3e9
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-17T08:37:50Z
publishDate 2017-10-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-6374b84fecdf4986a1318eae9e8ac3e92022-12-21T21:56:26ZengBMCBMC Bioinformatics1471-21052017-10-0118111110.1186/s12859-017-1856-9Mapping eQTL by leveraging multiple tissues and DNA methylationChaitanya R. Acharya0Kouros Owzar1Andrew S. Allen2Program in Computational Biology and Bioinformatics, Duke UniversityDepartment of Biostatistics and Bioinformatics, Duke UniversityProgram in Computational Biology and Bioinformatics, Duke UniversityAbstract Background DNA methylation is an important tissue-specific epigenetic event that influences transcriptional regulation of gene expression. Differentially methylated CpG sites may act as mediators between genetic variation and gene expression, and this relationship can be exploited while mapping multi-tissue expression quantitative trait loci (eQTL). Current multi-tissue eQTL mapping techniques are limited to only exploiting gene expression patterns across multiple tissues either in a joint tissue or tissue-by-tissue frameworks. We present a new statistical approach that enables us to model the effect of germ-line variation on tissue-specific gene expression in the presence of effects due to DNA methylation. Results Our method efficiently models genetic and epigenetic variation to identify genomic regions of interest containing combinations of mRNA transcripts, CpG sites, and SNPs by jointly testing for genotypic effect and higher order interaction effects between genotype, methylation and tissues. We demonstrate using Monte Carlo simulations that our approach, in the presence of both genetic and DNA methylation effects, gives an improved performance (in terms of statistical power) to detect eQTLs over the current eQTL mapping approaches. When applied to an array-based dataset from 150 neuropathologically normal adult human brains, our method identifies eQTLs that were undetected using standard tissue-by-tissue or joint tissue eQTL mapping techniques. As an example, our method identifies eQTLs by leveraging methylated CpG sites in a LIM homeobox member gene (LHX9), which may have a role in the neural development. Conclusions Our score test-based approach does not need parameter estimation under the alternative hypothesis. As a result, our model parameters are estimated only once for each mRNA - CpG pair. Our model specifically studies the effects of non-coding regions of DNA (in this case, CpG sites) on mapping eQTLs. However, we can easily model micro-RNAs instead of CpG sites to study the effects of post-transcriptional events in mapping eQTL. Our model’s flexible framework also allows us to investigate other genomic events such as alternative gene splicing by extending our model to include gene isoform-specific data.http://link.springer.com/article/10.1186/s12859-017-1856-9eQTLMultiple tissuesTissue-specificityDNA methylationCpG islandsGene expression
spellingShingle Chaitanya R. Acharya
Kouros Owzar
Andrew S. Allen
Mapping eQTL by leveraging multiple tissues and DNA methylation
BMC Bioinformatics
eQTL
Multiple tissues
Tissue-specificity
DNA methylation
CpG islands
Gene expression
title Mapping eQTL by leveraging multiple tissues and DNA methylation
title_full Mapping eQTL by leveraging multiple tissues and DNA methylation
title_fullStr Mapping eQTL by leveraging multiple tissues and DNA methylation
title_full_unstemmed Mapping eQTL by leveraging multiple tissues and DNA methylation
title_short Mapping eQTL by leveraging multiple tissues and DNA methylation
title_sort mapping eqtl by leveraging multiple tissues and dna methylation
topic eQTL
Multiple tissues
Tissue-specificity
DNA methylation
CpG islands
Gene expression
url http://link.springer.com/article/10.1186/s12859-017-1856-9
work_keys_str_mv AT chaitanyaracharya mappingeqtlbyleveragingmultipletissuesanddnamethylation
AT kourosowzar mappingeqtlbyleveragingmultipletissuesanddnamethylation
AT andrewsallen mappingeqtlbyleveragingmultipletissuesanddnamethylation