Finding Exact and Solo LTR-Retrotransposons in Biological Sequences Using SVM

Finding repetitive subsequences in genome is a challengeable problem in bioinformatics research area. A lot of approaches have been proposed to solve the problem, which could be divided to library base and de novo methods. The library base methods use predetermined repetitive genome’s subsequences,...

Full description

Bibliographic Details
Main Authors: Hesam Torabi Dashti, Ali Masoudi-Nejad, Fatemeh Zare
Format: Article
Language:English
Published: Iranian Institute of Research and Development in Chemical Industries (IRDCI)-ACECR 2012-06-01
Series:Iranian Journal of Chemistry & Chemical Engineering
Subjects:
Online Access:http://www.ijcce.ac.ir/article_5998_d65726fdbddfb154f4039913065df070.pdf
Description
Summary:Finding repetitive subsequences in genome is a challengeable problem in bioinformatics research area. A lot of approaches have been proposed to solve the problem, which could be divided to library base and de novo methods. The library base methods use predetermined repetitive genome’s subsequences, where library-less methods attempt to discover repetitive subsequences by analytical approaches. In this article we propose novel de novo methodology which stands on theory of pattern recognition’s science. Our methodology by using Support Vector Machine (SVM) classification and clustering methods could extract exact and Solo LTR-retrotransposons. This methodology issued to show complexity efficiency and applicability of the pattern recognition theories in bioinformatics and biomathematics research areas.We demonstrate applicability of our methodology by comparing its results with other well-known de novo method. Both applications return classes of discovered repetitive subsequences, were their results when had applied on show more that 90 percents similarities.
ISSN:1021-9986
1021-9986