Filtering of false positive microRNA candidates by a clustering-based approach

<p>Abstract</p> <p>Background</p> <p>MicroRNAs are small non-coding RNA gene products that play diversified roles from species to species. The explosive growth of microRNA researches in recent years proves the importance of microRNAs in the biological system and it is b...

Full description

Bibliographic Details
Main Authors: Cheung David W, Lin Marie CM, Leung Wing-Sze, Yiu SM
Format: Article
Language:English
Published: BMC 2008-12-01
Series:BMC Bioinformatics
_version_ 1828758455896244224
author Cheung David W
Lin Marie CM
Leung Wing-Sze
Yiu SM
author_facet Cheung David W
Lin Marie CM
Leung Wing-Sze
Yiu SM
author_sort Cheung David W
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>MicroRNAs are small non-coding RNA gene products that play diversified roles from species to species. The explosive growth of microRNA researches in recent years proves the importance of microRNAs in the biological system and it is believed that microRNAs have valuable therapeutic potentials in human diseases. Continual efforts are therefore required to locate and verify the unknown microRNAs in various genomes. As many miRNAs are found to be arranged in clusters, meaning that they are in close proximity with their neighboring miRNAs, we are interested in utilizing the concept of microRNA clustering and applying it in microRNA computational prediction.</p> <p>Results</p> <p>We first validate the microRNA clustering phenomenon in the human, mouse and rat genomes. There are 45.45%, 51.86% and 48.67% of the total miRNAs that are clustered in the three genomes, respectively. We then conduct sequence and secondary structure similarity analyses among clustered miRNAs, non-clustered miRNAs, neighboring sequences of clustered miRNAs and random sequences, and find that clustered miRNAs are structurally more similar to one another, and the <it>RNAdistance </it>score can be used to assess the structural similarity between two sequences. We therefore design a clustering-based approach which utilizes this observation to filter false positives from a list of candidates generated by a selected microRNA prediction program, and successfully raise the positive predictive value by a considerable amount ranging from 15.23% to 23.19% in the human, mouse and rat genomes, while keeping a reasonably high sensitivity.</p> <p>Conclusion</p> <p>Our clustering-based approach is able to increase the effectiveness of currently available microRNA prediction program by raising the positive predictive value while maintaining a high sensitivity, and hence can serve as a filtering step. We believe that it is worthwhile to carry out further experiments and tests with our approach using data from other genomes and other prediction software tools. Better results may be achieved with fine-tuning of parameters.</p>
first_indexed 2024-12-11T00:42:06Z
format Article
id doaj.art-93b608a8e17d4d9ca08023a3158f455d
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-11T00:42:06Z
publishDate 2008-12-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-93b608a8e17d4d9ca08023a3158f455d2022-12-22T01:26:52ZengBMCBMC Bioinformatics1471-21052008-12-019Suppl 12S310.1186/1471-2105-9-S12-S3Filtering of false positive microRNA candidates by a clustering-based approachCheung David WLin Marie CMLeung Wing-SzeYiu SM<p>Abstract</p> <p>Background</p> <p>MicroRNAs are small non-coding RNA gene products that play diversified roles from species to species. The explosive growth of microRNA researches in recent years proves the importance of microRNAs in the biological system and it is believed that microRNAs have valuable therapeutic potentials in human diseases. Continual efforts are therefore required to locate and verify the unknown microRNAs in various genomes. As many miRNAs are found to be arranged in clusters, meaning that they are in close proximity with their neighboring miRNAs, we are interested in utilizing the concept of microRNA clustering and applying it in microRNA computational prediction.</p> <p>Results</p> <p>We first validate the microRNA clustering phenomenon in the human, mouse and rat genomes. There are 45.45%, 51.86% and 48.67% of the total miRNAs that are clustered in the three genomes, respectively. We then conduct sequence and secondary structure similarity analyses among clustered miRNAs, non-clustered miRNAs, neighboring sequences of clustered miRNAs and random sequences, and find that clustered miRNAs are structurally more similar to one another, and the <it>RNAdistance </it>score can be used to assess the structural similarity between two sequences. We therefore design a clustering-based approach which utilizes this observation to filter false positives from a list of candidates generated by a selected microRNA prediction program, and successfully raise the positive predictive value by a considerable amount ranging from 15.23% to 23.19% in the human, mouse and rat genomes, while keeping a reasonably high sensitivity.</p> <p>Conclusion</p> <p>Our clustering-based approach is able to increase the effectiveness of currently available microRNA prediction program by raising the positive predictive value while maintaining a high sensitivity, and hence can serve as a filtering step. We believe that it is worthwhile to carry out further experiments and tests with our approach using data from other genomes and other prediction software tools. Better results may be achieved with fine-tuning of parameters.</p>
spellingShingle Cheung David W
Lin Marie CM
Leung Wing-Sze
Yiu SM
Filtering of false positive microRNA candidates by a clustering-based approach
BMC Bioinformatics
title Filtering of false positive microRNA candidates by a clustering-based approach
title_full Filtering of false positive microRNA candidates by a clustering-based approach
title_fullStr Filtering of false positive microRNA candidates by a clustering-based approach
title_full_unstemmed Filtering of false positive microRNA candidates by a clustering-based approach
title_short Filtering of false positive microRNA candidates by a clustering-based approach
title_sort filtering of false positive microrna candidates by a clustering based approach
work_keys_str_mv AT cheungdavidw filteringoffalsepositivemicrornacandidatesbyaclusteringbasedapproach
AT linmariecm filteringoffalsepositivemicrornacandidatesbyaclusteringbasedapproach
AT leungwingsze filteringoffalsepositivemicrornacandidatesbyaclusteringbasedapproach
AT yiusm filteringoffalsepositivemicrornacandidatesbyaclusteringbasedapproach