GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group

<p>Abstract</p> <p>Background</p> <p>Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF...

Full description

Bibliographic Details
Main Authors: Lee Tzong-Yi, Chang Wen-Chi, Hsu Justin, Chang Tzu-Hao, Shien Dray-Ming
Format: Article
Language:English
Published: BMC 2012-01-01
Series:BMC Genomics
_version_ 1811264610944155648
author Lee Tzong-Yi
Chang Wen-Chi
Hsu Justin
Chang Tzu-Hao
Shien Dray-Ming
author_facet Lee Tzong-Yi
Chang Wen-Chi
Hsu Justin
Chang Tzu-Hao
Shien Dray-Ming
author_sort Lee Tzong-Yi
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed.</p> <p>Results</p> <p>This work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of <it>cis</it>-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations.</p> <p>Conclusions</p> <p>The GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at <url>http://GPMiner.mbc.nctu.edu.tw/</url>.</p>
first_indexed 2024-04-12T20:07:27Z
format Article
id doaj.art-f0c132eaf6284062867d2b64e16e7ee8
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-12T20:07:27Z
publishDate 2012-01-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-f0c132eaf6284062867d2b64e16e7ee82022-12-22T03:18:21ZengBMCBMC Genomics1471-21642012-01-0113Suppl 1S310.1186/1471-2164-13-S1-S3GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene groupLee Tzong-YiChang Wen-ChiHsu JustinChang Tzu-HaoShien Dray-Ming<p>Abstract</p> <p>Background</p> <p>Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed.</p> <p>Results</p> <p>This work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of <it>cis</it>-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations.</p> <p>Conclusions</p> <p>The GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at <url>http://GPMiner.mbc.nctu.edu.tw/</url>.</p>
spellingShingle Lee Tzong-Yi
Chang Wen-Chi
Hsu Justin
Chang Tzu-Hao
Shien Dray-Ming
GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group
BMC Genomics
title GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group
title_full GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group
title_fullStr GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group
title_full_unstemmed GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group
title_short GPMiner: an integrated system for mining combinatorial <it>cis</it>-regulatory elements in mammalian gene group
title_sort gpminer an integrated system for mining combinatorial it cis it regulatory elements in mammalian gene group
work_keys_str_mv AT leetzongyi gpmineranintegratedsystemforminingcombinatorialitcisitregulatoryelementsinmammaliangenegroup
AT changwenchi gpmineranintegratedsystemforminingcombinatorialitcisitregulatoryelementsinmammaliangenegroup
AT hsujustin gpmineranintegratedsystemforminingcombinatorialitcisitregulatoryelementsinmammaliangenegroup
AT changtzuhao gpmineranintegratedsystemforminingcombinatorialitcisitregulatoryelementsinmammaliangenegroup
AT shiendrayming gpmineranintegratedsystemforminingcombinatorialitcisitregulatoryelementsinmammaliangenegroup