ASH structure alignment package: Sensitivity and selectivity in domain classification

<p>Abstract</p> <p>Background</p> <p>Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities...

Full description

Bibliographic Details
Main Authors: Toh Hiroyuki, Standley Daron M, Nakamura Haruki
Format: Article
Language:English
Published: BMC 2007-04-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/8/116
_version_ 1818832061790683136
author Toh Hiroyuki
Standley Daron M
Nakamura Haruki
author_facet Toh Hiroyuki
Standley Daron M
Nakamura Haruki
author_sort Toh Hiroyuki
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website <url>http://www.pdbj.org/ASH/</url>.</p> <p>Results</p> <p>The new ASH score shows increased selectivity and sensitivity compared with values reported for several popular programs using the same test set of 4,298,905 structure pairs, yielding an area of .96 under the receiver operating characteristic (ROC) curve. In addition, weak sequence homologies between similar domains are revealed that could not be detected by BLAST sequence alignment. Also, a subset of domain pairs is identified that exhibit high similarity, even though their CATH and SCOP classification differs. Finally, we show that the ranking of alignment programs based solely on geometric measures depends on the choice of the quality measure.</p> <p>Conclusion</p> <p>ASH shows high selectivity and sensitivity with regard to domain classification, an important step in defining distantly related protein sequence families. Moreover, the CPU cost per alignment is competitive with the fastest programs, making ASH a practical option for large-scale structure classification studies.</p>
first_indexed 2024-12-19T01:57:03Z
format Article
id doaj.art-0bd7d6eb3d2d40adad009a5945a7e35d
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-19T01:57:03Z
publishDate 2007-04-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-0bd7d6eb3d2d40adad009a5945a7e35d2022-12-21T20:41:11ZengBMCBMC Bioinformatics1471-21052007-04-018111610.1186/1471-2105-8-116ASH structure alignment package: Sensitivity and selectivity in domain classificationToh HiroyukiStandley Daron MNakamura Haruki<p>Abstract</p> <p>Background</p> <p>Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website <url>http://www.pdbj.org/ASH/</url>.</p> <p>Results</p> <p>The new ASH score shows increased selectivity and sensitivity compared with values reported for several popular programs using the same test set of 4,298,905 structure pairs, yielding an area of .96 under the receiver operating characteristic (ROC) curve. In addition, weak sequence homologies between similar domains are revealed that could not be detected by BLAST sequence alignment. Also, a subset of domain pairs is identified that exhibit high similarity, even though their CATH and SCOP classification differs. Finally, we show that the ranking of alignment programs based solely on geometric measures depends on the choice of the quality measure.</p> <p>Conclusion</p> <p>ASH shows high selectivity and sensitivity with regard to domain classification, an important step in defining distantly related protein sequence families. Moreover, the CPU cost per alignment is competitive with the fastest programs, making ASH a practical option for large-scale structure classification studies.</p>http://www.biomedcentral.com/1471-2105/8/116
spellingShingle Toh Hiroyuki
Standley Daron M
Nakamura Haruki
ASH structure alignment package: Sensitivity and selectivity in domain classification
BMC Bioinformatics
title ASH structure alignment package: Sensitivity and selectivity in domain classification
title_full ASH structure alignment package: Sensitivity and selectivity in domain classification
title_fullStr ASH structure alignment package: Sensitivity and selectivity in domain classification
title_full_unstemmed ASH structure alignment package: Sensitivity and selectivity in domain classification
title_short ASH structure alignment package: Sensitivity and selectivity in domain classification
title_sort ash structure alignment package sensitivity and selectivity in domain classification
url http://www.biomedcentral.com/1471-2105/8/116
work_keys_str_mv AT tohhiroyuki ashstructurealignmentpackagesensitivityandselectivityindomainclassification
AT standleydaronm ashstructurealignmentpackagesensitivityandselectivityindomainclassification
AT nakamuraharuki ashstructurealignmentpackagesensitivityandselectivityindomainclassification