WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

<p>Abstract</p> <p>Background</p> <p>This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely us...

Full description

Bibliographic Details
Main Authors: Pesole Graziano, Zambelli Federico, Pavesi Giulio
Format: Article
Language:English
Published: BMC 2007-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/8/46
_version_ 1818563738435846144
author Pesole Graziano
Zambelli Federico
Pavesi Giulio
author_facet Pesole Graziano
Zambelli Federico
Pavesi Giulio
author_sort Pesole Graziano
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available.</p> <p>Results</p> <p>We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers.</p> <p>Conclusion</p> <p>Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes.</p>
first_indexed 2024-12-14T01:20:18Z
format Article
id doaj.art-016c64d1c8a54cea8ed1baa9d80bad67
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-14T01:20:18Z
publishDate 2007-02-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-016c64d1c8a54cea8ed1baa9d80bad672022-12-21T23:22:25ZengBMCBMC Bioinformatics1471-21052007-02-01814610.1186/1471-2105-8-46WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequencesPesole GrazianoZambelli FedericoPavesi Giulio<p>Abstract</p> <p>Background</p> <p>This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available.</p> <p>Results</p> <p>We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers.</p> <p>Conclusion</p> <p>Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes.</p>http://www.biomedcentral.com/1471-2105/8/46
spellingShingle Pesole Graziano
Zambelli Federico
Pavesi Giulio
WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences
BMC Bioinformatics
title WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences
title_full WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences
title_fullStr WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences
title_full_unstemmed WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences
title_short WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences
title_sort weederh an algorithm for finding conserved regulatory motifs and regions in homologous sequences
url http://www.biomedcentral.com/1471-2105/8/46
work_keys_str_mv AT pesolegraziano weederhanalgorithmforfindingconservedregulatorymotifsandregionsinhomologoussequences
AT zambellifederico weederhanalgorithmforfindingconservedregulatorymotifsandregionsinhomologoussequences
AT pavesigiulio weederhanalgorithmforfindingconservedregulatorymotifsandregionsinhomologoussequences