Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions
Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic...
Main Authors: | , , |
---|---|
Format: | Article |
Published: |
World Scientific Publishing Co. Pte Ltd
2017
|
Subjects: |
_version_ | 1796863362043215872 |
---|---|
author | Fotoohifiroozabadi, S. Mohamad, M. S. Deris, S. |
author_facet | Fotoohifiroozabadi, S. Mohamad, M. S. Deris, S. |
author_sort | Fotoohifiroozabadi, S. |
collection | ePrints |
description | Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic demonstration of secondary-structure elements (SSE) of proteins and compared the assigned letters to each SSE using the n-gram method. Although the results were comparable to those obtained via geometrical methods, the SSE length and accuracy of adjacency between SSEs were not considered in the comparison process. Therefore, to obtain further information on accuracy of adjacency between SSE vectors, the new approach of assigning text to vectors was adopted according to the spherical coordinate system in the present study. Moreover, dynamic programming was applied in order to account for the length of SSE vectors. Five common datasets were selected for method evaluation. The first three datasets were small, but difficult to align, and the remaining two datasets were used to compare the capability of the proposed method with that of other methods on a large protein dataset. The results showed that the proposed method, as a text-based alignment approach, obtained results comparable to both 1D and 3D methods. It outperformed 1D methods in terms of accuracy and 3D methods in terms of runtime. |
first_indexed | 2024-03-05T20:25:37Z |
format | Article |
id | utm.eprints-81309 |
institution | Universiti Teknologi Malaysia - ePrints |
last_indexed | 2024-03-05T20:25:37Z |
publishDate | 2017 |
publisher | World Scientific Publishing Co. Pte Ltd |
record_format | dspace |
spelling | utm.eprints-813092019-08-04T04:47:55Z http://eprints.utm.my/81309/ Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions Fotoohifiroozabadi, S. Mohamad, M. S. Deris, S. QA75 Electronic computers. Computer science Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic demonstration of secondary-structure elements (SSE) of proteins and compared the assigned letters to each SSE using the n-gram method. Although the results were comparable to those obtained via geometrical methods, the SSE length and accuracy of adjacency between SSEs were not considered in the comparison process. Therefore, to obtain further information on accuracy of adjacency between SSE vectors, the new approach of assigning text to vectors was adopted according to the spherical coordinate system in the present study. Moreover, dynamic programming was applied in order to account for the length of SSE vectors. Five common datasets were selected for method evaluation. The first three datasets were small, but difficult to align, and the remaining two datasets were used to compare the capability of the proposed method with that of other methods on a large protein dataset. The results showed that the proposed method, as a text-based alignment approach, obtained results comparable to both 1D and 3D methods. It outperformed 1D methods in terms of accuracy and 3D methods in terms of runtime. World Scientific Publishing Co. Pte Ltd 2017 Article PeerReviewed Fotoohifiroozabadi, S. and Mohamad, M. S. and Deris, S. (2017) Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions. Journal of Bioinformatics and Computational Biology, 15 (2). ISSN 0219-7200 http://dx.doi.org/10.1142/S0219720017500044 DOI:10.1142/S0219720017500044 |
spellingShingle | QA75 Electronic computers. Computer science Fotoohifiroozabadi, S. Mohamad, M. S. Deris, S. Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions |
title | Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions |
title_full | Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions |
title_fullStr | Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions |
title_full_unstemmed | Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions |
title_short | Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions |
title_sort | samira vp a simple protein alignment method with rechecking the alphabet vector positions |
topic | QA75 Electronic computers. Computer science |
work_keys_str_mv | AT fotoohifiroozabadis samiravpasimpleproteinalignmentmethodwithrecheckingthealphabetvectorpositions AT mohamadms samiravpasimpleproteinalignmentmethodwithrecheckingthealphabetvectorpositions AT deriss samiravpasimpleproteinalignmentmethodwithrecheckingthealphabetvectorpositions |