Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions

Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic...

Full description

Bibliographic Details
Main Authors: Fotoohifiroozabadi, S., Mohamad, M. S., Deris, S.
Format: Article
Published: World Scientific Publishing Co. Pte Ltd 2017
Subjects:
_version_ 1796863362043215872
author Fotoohifiroozabadi, S.
Mohamad, M. S.
Deris, S.
author_facet Fotoohifiroozabadi, S.
Mohamad, M. S.
Deris, S.
author_sort Fotoohifiroozabadi, S.
collection ePrints
description Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic demonstration of secondary-structure elements (SSE) of proteins and compared the assigned letters to each SSE using the n-gram method. Although the results were comparable to those obtained via geometrical methods, the SSE length and accuracy of adjacency between SSEs were not considered in the comparison process. Therefore, to obtain further information on accuracy of adjacency between SSE vectors, the new approach of assigning text to vectors was adopted according to the spherical coordinate system in the present study. Moreover, dynamic programming was applied in order to account for the length of SSE vectors. Five common datasets were selected for method evaluation. The first three datasets were small, but difficult to align, and the remaining two datasets were used to compare the capability of the proposed method with that of other methods on a large protein dataset. The results showed that the proposed method, as a text-based alignment approach, obtained results comparable to both 1D and 3D methods. It outperformed 1D methods in terms of accuracy and 3D methods in terms of runtime.
first_indexed 2024-03-05T20:25:37Z
format Article
id utm.eprints-81309
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T20:25:37Z
publishDate 2017
publisher World Scientific Publishing Co. Pte Ltd
record_format dspace
spelling utm.eprints-813092019-08-04T04:47:55Z http://eprints.utm.my/81309/ Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions Fotoohifiroozabadi, S. Mohamad, M. S. Deris, S. QA75 Electronic computers. Computer science Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic demonstration of secondary-structure elements (SSE) of proteins and compared the assigned letters to each SSE using the n-gram method. Although the results were comparable to those obtained via geometrical methods, the SSE length and accuracy of adjacency between SSEs were not considered in the comparison process. Therefore, to obtain further information on accuracy of adjacency between SSE vectors, the new approach of assigning text to vectors was adopted according to the spherical coordinate system in the present study. Moreover, dynamic programming was applied in order to account for the length of SSE vectors. Five common datasets were selected for method evaluation. The first three datasets were small, but difficult to align, and the remaining two datasets were used to compare the capability of the proposed method with that of other methods on a large protein dataset. The results showed that the proposed method, as a text-based alignment approach, obtained results comparable to both 1D and 3D methods. It outperformed 1D methods in terms of accuracy and 3D methods in terms of runtime. World Scientific Publishing Co. Pte Ltd 2017 Article PeerReviewed Fotoohifiroozabadi, S. and Mohamad, M. S. and Deris, S. (2017) Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions. Journal of Bioinformatics and Computational Biology, 15 (2). ISSN 0219-7200 http://dx.doi.org/10.1142/S0219720017500044 DOI:10.1142/S0219720017500044
spellingShingle QA75 Electronic computers. Computer science
Fotoohifiroozabadi, S.
Mohamad, M. S.
Deris, S.
Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
title Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
title_full Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
title_fullStr Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
title_full_unstemmed Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
title_short Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
title_sort samira vp a simple protein alignment method with rechecking the alphabet vector positions
topic QA75 Electronic computers. Computer science
work_keys_str_mv AT fotoohifiroozabadis samiravpasimpleproteinalignmentmethodwithrecheckingthealphabetvectorpositions
AT mohamadms samiravpasimpleproteinalignmentmethodwithrecheckingthealphabetvectorpositions
AT deriss samiravpasimpleproteinalignmentmethodwithrecheckingthealphabetvectorpositions