Rules extraction from neural networks applied to the prediction and recognition of prokaryotic promoters

Promoters are DNA sequences located upstream of the gene region and play a central role in gene expression. Computational techniques show good accuracy in gene prediction but are less successful in predicting promoters, primarily because of the high number of false positives that reflect characteris...

Full description

Bibliographic Details
Main Authors: Scheila de Avila e Silva, Günther J.L. Gerhardt, Sergio Echeverrigaray
Format: Article
Language:English
Published: Sociedade Brasileira de Genética 2011-01-01
Series:Genetics and Molecular Biology
Subjects:
Online Access:http://www.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572011000200031
Description
Summary:Promoters are DNA sequences located upstream of the gene region and play a central role in gene expression. Computational techniques show good accuracy in gene prediction but are less successful in predicting promoters, primarily because of the high number of false positives that reflect characteristics of the promoter sequences. Many machine learning methods have been used to address this issue. Neural Networks (NN) have been successfully used in this field because of their ability to recognize imprecise and incomplete patterns characteristic of promoter sequences. In this paper, NN was used to predict and recognize promoter sequences in two data sets: (i) one based on nucleotide sequence information and (ii) another based on stability sequence information. The accuracy was approximately 80% for simulation (i) and 68% for simulation (ii). In the rules extracted, biological consensus motifs were important parts of the NN learning process in both simulations.
ISSN:1415-4757
1678-4685