<it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves

<p>Abstract</p> <p>Background</p> <p>The discovery of cis-regulatory motifs still remains a challenging task even though the number of sequenced genomes is constantly growing. Computational analyses using pattern search algorithms have been valuable in phylogenetic foot...

Full description

Bibliographic Details
Main Authors: Harter Klaus, Stüber Kurt, Berendzen Kenneth W, Wanke Dierk
Format: Article
Language:English
Published: BMC 2006-11-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/522
_version_ 1818187630086455296
author Harter Klaus
Stüber Kurt
Berendzen Kenneth W
Wanke Dierk
author_facet Harter Klaus
Stüber Kurt
Berendzen Kenneth W
Wanke Dierk
author_sort Harter Klaus
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The discovery of cis-regulatory motifs still remains a challenging task even though the number of sequenced genomes is constantly growing. Computational analyses using pattern search algorithms have been valuable in phylogenetic footprinting approaches as have expression profile experiments to predict co-occurring motifs. Surprisingly little is known about the nature of <it>cis</it>-regulatory element (CRE) distribution in promoters.</p> <p>Results</p> <p>In this paper we used the Motif Mapper open-source collection of visual basic scripts for the analysis of motifs in any aligned set of DNA sequences. We focused on promoter motif distribution curves to identify positional over-representation of DNA motifs. Using differentially aligned datasets from the model species <it>Arabidopsis thaliana</it>, <it>Caenorhabditis </it>elegans, <it>Drosophila melanogaster </it>and <it>Saccharomyces cerevisiae</it>, we convincingly demonstrated the importance of the position and orientation for motif discovery. Analysis with known CREs and all possible hexanucleotides showed that some functional elements gather close to the transcription and translation initiation sites and that elements other than the TATA-box motif are conserved between eukaryote promoters. While a high background frequency usually decreases the effectiveness of such an enumerative investigation, we improved our analysis by conducting motif distribution maps using large datasets.</p> <p>Conclusion</p> <p>This is the first study to reveal positional over-representation of CREs and promoter motifs in a cross-species approach. CREs and motifs shared between eukaryotic promoters support the observation that an eukaryotic promoter structure has been conserved throughout evolutionary time. Furthermore, with the information on positional enrichment of a motif or a known functional CRE, it is possible to get a more detailed insight into where an element appears to function. This in turn might accelerate the in depth examination of known and yet unknown <it>cis</it>-regulatory sequences in the laboratory.</p>
first_indexed 2024-12-11T23:14:05Z
format Article
id doaj.art-888440267daf48b0921efa4f354d6447
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-11T23:14:05Z
publishDate 2006-11-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-888440267daf48b0921efa4f354d64472022-12-22T00:46:37ZengBMCBMC Bioinformatics1471-21052006-11-017152210.1186/1471-2105-7-522<it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curvesHarter KlausStüber KurtBerendzen Kenneth WWanke Dierk<p>Abstract</p> <p>Background</p> <p>The discovery of cis-regulatory motifs still remains a challenging task even though the number of sequenced genomes is constantly growing. Computational analyses using pattern search algorithms have been valuable in phylogenetic footprinting approaches as have expression profile experiments to predict co-occurring motifs. Surprisingly little is known about the nature of <it>cis</it>-regulatory element (CRE) distribution in promoters.</p> <p>Results</p> <p>In this paper we used the Motif Mapper open-source collection of visual basic scripts for the analysis of motifs in any aligned set of DNA sequences. We focused on promoter motif distribution curves to identify positional over-representation of DNA motifs. Using differentially aligned datasets from the model species <it>Arabidopsis thaliana</it>, <it>Caenorhabditis </it>elegans, <it>Drosophila melanogaster </it>and <it>Saccharomyces cerevisiae</it>, we convincingly demonstrated the importance of the position and orientation for motif discovery. Analysis with known CREs and all possible hexanucleotides showed that some functional elements gather close to the transcription and translation initiation sites and that elements other than the TATA-box motif are conserved between eukaryote promoters. While a high background frequency usually decreases the effectiveness of such an enumerative investigation, we improved our analysis by conducting motif distribution maps using large datasets.</p> <p>Conclusion</p> <p>This is the first study to reveal positional over-representation of CREs and promoter motifs in a cross-species approach. CREs and motifs shared between eukaryotic promoters support the observation that an eukaryotic promoter structure has been conserved throughout evolutionary time. Furthermore, with the information on positional enrichment of a motif or a known functional CRE, it is possible to get a more detailed insight into where an element appears to function. This in turn might accelerate the in depth examination of known and yet unknown <it>cis</it>-regulatory sequences in the laboratory.</p>http://www.biomedcentral.com/1471-2105/7/522
spellingShingle Harter Klaus
Stüber Kurt
Berendzen Kenneth W
Wanke Dierk
<it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
BMC Bioinformatics
title <it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
title_full <it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
title_fullStr <it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
title_full_unstemmed <it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
title_short <it>Cis</it>-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
title_sort it cis it motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves
url http://www.biomedcentral.com/1471-2105/7/522
work_keys_str_mv AT harterklaus itcisitmotifsupstreamofthetranscriptionandtranslationinitiationsitesareeffectivelyrevealedbytheirpositionaldisequilibriumineukaryotegenomesusingfrequencydistributioncurves
AT stuberkurt itcisitmotifsupstreamofthetranscriptionandtranslationinitiationsitesareeffectivelyrevealedbytheirpositionaldisequilibriumineukaryotegenomesusingfrequencydistributioncurves
AT berendzenkennethw itcisitmotifsupstreamofthetranscriptionandtranslationinitiationsitesareeffectivelyrevealedbytheirpositionaldisequilibriumineukaryotegenomesusingfrequencydistributioncurves
AT wankedierk itcisitmotifsupstreamofthetranscriptionandtranslationinitiationsitesareeffectivelyrevealedbytheirpositionaldisequilibriumineukaryotegenomesusingfrequencydistributioncurves