Along the Bos taurus genome, uncover candidate imprinting control regions

Abstract Background In mammals, Imprinting Control Regions (ICRs) regulate a subset of genes in a parent-of-origin-specific manner. In both human and mouse, previous studies identified a set of CpG-rich motifs occurring as clusters in ICRs and germline Differentially Methylated Regions (gDMRs). Thes...

Full description

Bibliographic Details
Main Authors: Phillip Wyss, Carol Song, Minou Bina
Format: Article
Language:English
Published: BMC 2022-06-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-022-08694-3
_version_ 1828469102585315328
author Phillip Wyss
Carol Song
Minou Bina
author_facet Phillip Wyss
Carol Song
Minou Bina
author_sort Phillip Wyss
collection DOAJ
description Abstract Background In mammals, Imprinting Control Regions (ICRs) regulate a subset of genes in a parent-of-origin-specific manner. In both human and mouse, previous studies identified a set of CpG-rich motifs occurring as clusters in ICRs and germline Differentially Methylated Regions (gDMRs). These motifs consist of the ZFP57 binding site (ZFBS) overlapping a subset of MLL binding units known as MLL morphemes. MLL or MLL1 (Mixed Lineage Leukemia 1) is a relatively large multidomain protein that plays a central role in the regulation of transcription. The structures of both MLL1 and MLL2 include a domain (MT) that binds CpG-rich DNA and a conserved domain (SET) that methylates lysine 4 in histone H3 producing H3K4me3 marks in chromatin. Results Since genomic imprinting impacts many developmental and key physiological processes, we followed a previous bioinformatics strategy to pinpoint ICR positions in the Bos taurus genome. Initial genome-wide analyses involved finding the positions of ZFP57 binding sites, and the CpG-rich motifs (ZFBS-morph overlaps) along cattle chromosomal DNA. By creating plots displaying the density of ZFBS-morph overlaps, we removed background noise and thus improved signal detection. With the density-plots, we could view the positions of peaks locating known and candidate ICRs in cattle DNA. Our evaluations revealed the correspondence of peaks in plots to reported known and inferred ICRs/DMRs in cattle. Beside peaks pinpointing such ICRs, the density-plots also revealed additional peaks. Since evaluations validated the robustness of our approach, we inferred that the additional peaks may correspond to candidate ICRs for imprinted gene expression. Conclusion Our bioinformatics strategy offers the first genome-wide approach for systematically localizing candidate ICRs. Furthermore, we have tailored our datasets for upload onto the UCSC genome browser so that researchers could find known and candidate ICRs with respect to a wide variety of annotations at all scales: from the positions of Single Nucleotide Polymorphisms (SNPs), to positions of genes, transcripts, and repeated DNA elements. Furthermore, the UCSC genome browser offers tools to produce enlarged views: to uncover the genes in the vicinity of candidate ICRs and thus discover potential imprinted genes for experimental validations.
first_indexed 2024-12-11T04:34:08Z
format Article
id doaj.art-2a2f01e66bb44325a76333db2ff78c46
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-11T04:34:08Z
publishDate 2022-06-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-2a2f01e66bb44325a76333db2ff78c462022-12-22T01:20:47ZengBMCBMC Genomics1471-21642022-06-0123112110.1186/s12864-022-08694-3Along the Bos taurus genome, uncover candidate imprinting control regionsPhillip Wyss0Carol Song1Minou Bina2Department of Chemistry, Purdue UniversityInformation Technology, Purdue UniversityDepartment of Chemistry, Purdue UniversityAbstract Background In mammals, Imprinting Control Regions (ICRs) regulate a subset of genes in a parent-of-origin-specific manner. In both human and mouse, previous studies identified a set of CpG-rich motifs occurring as clusters in ICRs and germline Differentially Methylated Regions (gDMRs). These motifs consist of the ZFP57 binding site (ZFBS) overlapping a subset of MLL binding units known as MLL morphemes. MLL or MLL1 (Mixed Lineage Leukemia 1) is a relatively large multidomain protein that plays a central role in the regulation of transcription. The structures of both MLL1 and MLL2 include a domain (MT) that binds CpG-rich DNA and a conserved domain (SET) that methylates lysine 4 in histone H3 producing H3K4me3 marks in chromatin. Results Since genomic imprinting impacts many developmental and key physiological processes, we followed a previous bioinformatics strategy to pinpoint ICR positions in the Bos taurus genome. Initial genome-wide analyses involved finding the positions of ZFP57 binding sites, and the CpG-rich motifs (ZFBS-morph overlaps) along cattle chromosomal DNA. By creating plots displaying the density of ZFBS-morph overlaps, we removed background noise and thus improved signal detection. With the density-plots, we could view the positions of peaks locating known and candidate ICRs in cattle DNA. Our evaluations revealed the correspondence of peaks in plots to reported known and inferred ICRs/DMRs in cattle. Beside peaks pinpointing such ICRs, the density-plots also revealed additional peaks. Since evaluations validated the robustness of our approach, we inferred that the additional peaks may correspond to candidate ICRs for imprinted gene expression. Conclusion Our bioinformatics strategy offers the first genome-wide approach for systematically localizing candidate ICRs. Furthermore, we have tailored our datasets for upload onto the UCSC genome browser so that researchers could find known and candidate ICRs with respect to a wide variety of annotations at all scales: from the positions of Single Nucleotide Polymorphisms (SNPs), to positions of genes, transcripts, and repeated DNA elements. Furthermore, the UCSC genome browser offers tools to produce enlarged views: to uncover the genes in the vicinity of candidate ICRs and thus discover potential imprinted genes for experimental validations.https://doi.org/10.1186/s12864-022-08694-3Cattle genomicsCNR1DGAT1HMGA2KAP1KMT2A
spellingShingle Phillip Wyss
Carol Song
Minou Bina
Along the Bos taurus genome, uncover candidate imprinting control regions
BMC Genomics
Cattle genomics
CNR1
DGAT1
HMGA2
KAP1
KMT2A
title Along the Bos taurus genome, uncover candidate imprinting control regions
title_full Along the Bos taurus genome, uncover candidate imprinting control regions
title_fullStr Along the Bos taurus genome, uncover candidate imprinting control regions
title_full_unstemmed Along the Bos taurus genome, uncover candidate imprinting control regions
title_short Along the Bos taurus genome, uncover candidate imprinting control regions
title_sort along the bos taurus genome uncover candidate imprinting control regions
topic Cattle genomics
CNR1
DGAT1
HMGA2
KAP1
KMT2A
url https://doi.org/10.1186/s12864-022-08694-3
work_keys_str_mv AT phillipwyss alongthebostaurusgenomeuncovercandidateimprintingcontrolregions
AT carolsong alongthebostaurusgenomeuncovercandidateimprintingcontrolregions
AT minoubina alongthebostaurusgenomeuncovercandidateimprintingcontrolregions