Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.

DNA methylation of CpG islands plays a crucial role in the regulation of gene expression. More than half of all human promoters contain CpG islands with a tissue-specific methylation pattern in differentiated cells. Still today, the whole process of how DNA methyltransferases determine which region...

Full description

Bibliographic Details
Main Authors: Clemens Wrzodek, Finja Büchel, Georg Hinselmann, Johannes Eichner, Florian Mittag, Andreas Zell
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3340366?pdf=render
_version_ 1819230005711863808
author Clemens Wrzodek
Clemens Wrzodek
Finja Büchel
Georg Hinselmann
Johannes Eichner
Florian Mittag
Andreas Zell
author_facet Clemens Wrzodek
Clemens Wrzodek
Finja Büchel
Georg Hinselmann
Johannes Eichner
Florian Mittag
Andreas Zell
author_sort Clemens Wrzodek
collection DOAJ
description DNA methylation of CpG islands plays a crucial role in the regulation of gene expression. More than half of all human promoters contain CpG islands with a tissue-specific methylation pattern in differentiated cells. Still today, the whole process of how DNA methyltransferases determine which region should be methylated is not completely revealed. There are many hypotheses of which genomic features are correlated to the epigenome that have not yet been evaluated. Furthermore, many explorative approaches of measuring DNA methylation are limited to a subset of the genome and thus, cannot be employed, e.g., for genome-wide biomarker prediction methods. In this study, we evaluated the correlation of genetic, epigenetic and hypothesis-driven features to DNA methylation of CpG islands. To this end, various binary classifiers were trained and evaluated by cross-validation on a dataset comprising DNA methylation data for 190 CpG islands in HEPG2, HEK293, fibroblasts and leukocytes. We achieved an accuracy of up to 91% with an MCC of 0.8 using ten-fold cross-validation and ten repetitions. With these models, we extended the existing dataset to the whole genome and thus, predicted the methylation landscape for the given cell types. The method used for these predictions is also validated on another external whole-genome dataset. Our results reveal features correlated to DNA methylation and confirm or disprove various hypotheses of DNA methylation related features. This study confirms correlations between DNA methylation and histone modifications, DNA structure, DNA sequence, genomic attributes and CpG island properties. Furthermore, the method has been validated on a genome-wide dataset from the ENCODE consortium. The developed software, as well as the predicted datasets and a web-service to compare methylation states of CpG islands are available at http://www.cogsys.cs.uni-tuebingen.de/software/dna-methylation/.
first_indexed 2024-12-23T11:22:12Z
format Article
id doaj.art-f55ee9ca255f4dc98aa5a94acd70636a
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-23T11:22:12Z
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-f55ee9ca255f4dc98aa5a94acd70636a2022-12-21T17:49:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0174e3532710.1371/journal.pone.0035327Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.Clemens WrzodekClemens WrzodekFinja BüchelGeorg HinselmannJohannes EichnerFlorian MittagAndreas ZellDNA methylation of CpG islands plays a crucial role in the regulation of gene expression. More than half of all human promoters contain CpG islands with a tissue-specific methylation pattern in differentiated cells. Still today, the whole process of how DNA methyltransferases determine which region should be methylated is not completely revealed. There are many hypotheses of which genomic features are correlated to the epigenome that have not yet been evaluated. Furthermore, many explorative approaches of measuring DNA methylation are limited to a subset of the genome and thus, cannot be employed, e.g., for genome-wide biomarker prediction methods. In this study, we evaluated the correlation of genetic, epigenetic and hypothesis-driven features to DNA methylation of CpG islands. To this end, various binary classifiers were trained and evaluated by cross-validation on a dataset comprising DNA methylation data for 190 CpG islands in HEPG2, HEK293, fibroblasts and leukocytes. We achieved an accuracy of up to 91% with an MCC of 0.8 using ten-fold cross-validation and ten repetitions. With these models, we extended the existing dataset to the whole genome and thus, predicted the methylation landscape for the given cell types. The method used for these predictions is also validated on another external whole-genome dataset. Our results reveal features correlated to DNA methylation and confirm or disprove various hypotheses of DNA methylation related features. This study confirms correlations between DNA methylation and histone modifications, DNA structure, DNA sequence, genomic attributes and CpG island properties. Furthermore, the method has been validated on a genome-wide dataset from the ENCODE consortium. The developed software, as well as the predicted datasets and a web-service to compare methylation states of CpG islands are available at http://www.cogsys.cs.uni-tuebingen.de/software/dna-methylation/.http://europepmc.org/articles/PMC3340366?pdf=render
spellingShingle Clemens Wrzodek
Clemens Wrzodek
Finja Büchel
Georg Hinselmann
Johannes Eichner
Florian Mittag
Andreas Zell
Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.
PLoS ONE
title Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.
title_full Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.
title_fullStr Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.
title_full_unstemmed Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.
title_short Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.
title_sort linking the epigenome to the genome correlation of different features to dna methylation of cpg islands
url http://europepmc.org/articles/PMC3340366?pdf=render
work_keys_str_mv AT clemenswrzodek linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands
AT clemenswrzodek linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands
AT finjabuchel linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands
AT georghinselmann linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands
AT johanneseichner linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands
AT florianmittag linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands
AT andreaszell linkingtheepigenometothegenomecorrelationofdifferentfeaturestodnamethylationofcpgislands