MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.

<h4>Background</h4>MicroRNAs (miRNAs) are small, single stranded RNAs with a key role in post-transcriptional regulation of thousands of genes across numerous species. While several computational methods are currently available for identifying miRNA genes, accurate prediction of the matu...

Full description

Bibliographic Details
Main Authors: Katerina Gkirtzou, Ioannis Tsamardinos, Panagiotis Tsakalides, Panayiota Poirazi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2010-08-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20700506/?tool=EBI
_version_ 1797937959766327296
author Katerina Gkirtzou
Ioannis Tsamardinos
Panagiotis Tsakalides
Panayiota Poirazi
author_facet Katerina Gkirtzou
Ioannis Tsamardinos
Panagiotis Tsakalides
Panayiota Poirazi
author_sort Katerina Gkirtzou
collection DOAJ
description <h4>Background</h4>MicroRNAs (miRNAs) are small, single stranded RNAs with a key role in post-transcriptional regulation of thousands of genes across numerous species. While several computational methods are currently available for identifying miRNA genes, accurate prediction of the mature miRNA remains a challenge. Existing approaches fall short in predicting the location of mature miRNAs but also in finding the functional strand(s) of miRNA precursors.<h4>Methodology/principal findings</h4>Here, we present a computational tool that incorporates a Naive Bayes classifier to identify mature miRNA candidates based on sequence and secondary structure information of their miRNA precursors. We take into account both positive (true mature miRNAs) and negative (same-size non-mature miRNA sequences) examples to optimize sensitivity as well as specificity. Our method can accurately predict the start position of experimentally verified mature miRNAs for both human and mouse, achieving a significantly larger (often double) performance accuracy compared with two existing methods. Moreover, the method exhibits a very high generalization performance on miRNAs from two other organisms. More importantly, our method provides direct evidence about the features of miRNA precursors which may determine the location of the mature miRNA. We find that the triplet of positions 7, 8 and 9 from the mature miRNA end towards the closest hairpin have the largest discriminatory power, are relatively conserved in terms of sequence composition (mostly contain a Uracil) and are located within or in very close proximity to the hairpin loop, suggesting the existence of a possible recognition site for Dicer and associated proteins.<h4>Conclusions</h4>This work describes a novel algorithm for identifying the start position of mature miRNA(s) produced by miRNA precursors. Our tool has significantly better (often double) performance than two existing approaches and provides new insights about the potential use of specific sequence/structural information as recognition signals for Dicer processing. Web Tool available at: http://mirna.imbb.forth.gr/MatureBayes.html.
first_indexed 2024-04-10T18:52:20Z
format Article
id doaj.art-4d0e1bb217104fb2904abab4d2671d54
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-10T18:52:20Z
publishDate 2010-08-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-4d0e1bb217104fb2904abab4d2671d542023-02-01T05:31:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-08-0158e1184310.1371/journal.pone.0011843MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.Katerina GkirtzouIoannis TsamardinosPanagiotis TsakalidesPanayiota Poirazi<h4>Background</h4>MicroRNAs (miRNAs) are small, single stranded RNAs with a key role in post-transcriptional regulation of thousands of genes across numerous species. While several computational methods are currently available for identifying miRNA genes, accurate prediction of the mature miRNA remains a challenge. Existing approaches fall short in predicting the location of mature miRNAs but also in finding the functional strand(s) of miRNA precursors.<h4>Methodology/principal findings</h4>Here, we present a computational tool that incorporates a Naive Bayes classifier to identify mature miRNA candidates based on sequence and secondary structure information of their miRNA precursors. We take into account both positive (true mature miRNAs) and negative (same-size non-mature miRNA sequences) examples to optimize sensitivity as well as specificity. Our method can accurately predict the start position of experimentally verified mature miRNAs for both human and mouse, achieving a significantly larger (often double) performance accuracy compared with two existing methods. Moreover, the method exhibits a very high generalization performance on miRNAs from two other organisms. More importantly, our method provides direct evidence about the features of miRNA precursors which may determine the location of the mature miRNA. We find that the triplet of positions 7, 8 and 9 from the mature miRNA end towards the closest hairpin have the largest discriminatory power, are relatively conserved in terms of sequence composition (mostly contain a Uracil) and are located within or in very close proximity to the hairpin loop, suggesting the existence of a possible recognition site for Dicer and associated proteins.<h4>Conclusions</h4>This work describes a novel algorithm for identifying the start position of mature miRNA(s) produced by miRNA precursors. Our tool has significantly better (often double) performance than two existing approaches and provides new insights about the potential use of specific sequence/structural information as recognition signals for Dicer processing. Web Tool available at: http://mirna.imbb.forth.gr/MatureBayes.html.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20700506/?tool=EBI
spellingShingle Katerina Gkirtzou
Ioannis Tsamardinos
Panagiotis Tsakalides
Panayiota Poirazi
MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.
PLoS ONE
title MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.
title_full MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.
title_fullStr MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.
title_full_unstemmed MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.
title_short MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.
title_sort maturebayes a probabilistic algorithm for identifying the mature mirna within novel precursors
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20700506/?tool=EBI
work_keys_str_mv AT katerinagkirtzou maturebayesaprobabilisticalgorithmforidentifyingthematuremirnawithinnovelprecursors
AT ioannistsamardinos maturebayesaprobabilisticalgorithmforidentifyingthematuremirnawithinnovelprecursors
AT panagiotistsakalides maturebayesaprobabilisticalgorithmforidentifyingthematuremirnawithinnovelprecursors
AT panayiotapoirazi maturebayesaprobabilisticalgorithmforidentifyingthematuremirnawithinnovelprecursors