Quantifying the similarities within fold space.

We have used GRATH, a graph-based structure comparison algorithm, to map the similarities between the different folds observed in the CATH domain structure database. Statistical analysis of the distributions of the fold similarities has allowed us to assess the significance for any similarity. There...

Full description

Bibliographic Details
Main Authors: Harrison, A, Pearl, F, Mott, R, Thornton, J, Orengo, C
Format: Journal article
Language:English
Published: 2002
_version_ 1797071046843564032
author Harrison, A
Pearl, F
Mott, R
Thornton, J
Orengo, C
author_facet Harrison, A
Pearl, F
Mott, R
Thornton, J
Orengo, C
author_sort Harrison, A
collection OXFORD
description We have used GRATH, a graph-based structure comparison algorithm, to map the similarities between the different folds observed in the CATH domain structure database. Statistical analysis of the distributions of the fold similarities has allowed us to assess the significance for any similarity. Therefore we have examined whether it is best to represent folds as discrete entities or whether, in fact, a more accurate model would be a continuum wherein folds overlap via common motifs. To do this we have introduced a new statistical measure of fold similarity, termed gregariousness. For a particular fold, gregariousness measures how many other folds have a significant structural overlap with that fold, typically comprising 40% or more of the larger structure. Gregarious folds often contain commonly occurring super-secondary structural motifs, such as beta-meanders, greek keys, alpha-beta plait motifs or alpha-hairpins, which are matching similar motifs in other folds. Apart from one example, all the most gregarious folds matching 20% or more of the other folds in the database, are alpha-beta proteins. They also occur in highly populated architectural regions of fold space, adopting sandwich-like arrangements containing two or more layers of alpha-helices and beta-strands.Domains that exhibit a low gregariousness, are those that have very distinctive folds, with few common motifs or motifs that are packed in unusual arrangements. Most of the superhelices exhibit low gregariousness despite containing some commonly occurring super-secondary structural motifs. In these folds, these common motifs are combined in an unusual way and represent a small proportion of the fold (<10%). Our results suggest that fold space may be considered as continuous for some architectural arrangements (e.g. alpha-beta sandwiches), in that super-secondary motifs can be used to link neighbouring fold groups. However, in other regions of fold space much more discrete topologies are observed with little similarity between folds.
first_indexed 2024-03-06T22:47:34Z
format Journal article
id oxford-uuid:5db10b5c-fc82-4db6-9087-704b43f4b7ba
institution University of Oxford
language English
last_indexed 2024-03-06T22:47:34Z
publishDate 2002
record_format dspace
spelling oxford-uuid:5db10b5c-fc82-4db6-9087-704b43f4b7ba2022-03-26T17:35:59ZQuantifying the similarities within fold space.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:5db10b5c-fc82-4db6-9087-704b43f4b7baEnglishSymplectic Elements at Oxford2002Harrison, APearl, FMott, RThornton, JOrengo, CWe have used GRATH, a graph-based structure comparison algorithm, to map the similarities between the different folds observed in the CATH domain structure database. Statistical analysis of the distributions of the fold similarities has allowed us to assess the significance for any similarity. Therefore we have examined whether it is best to represent folds as discrete entities or whether, in fact, a more accurate model would be a continuum wherein folds overlap via common motifs. To do this we have introduced a new statistical measure of fold similarity, termed gregariousness. For a particular fold, gregariousness measures how many other folds have a significant structural overlap with that fold, typically comprising 40% or more of the larger structure. Gregarious folds often contain commonly occurring super-secondary structural motifs, such as beta-meanders, greek keys, alpha-beta plait motifs or alpha-hairpins, which are matching similar motifs in other folds. Apart from one example, all the most gregarious folds matching 20% or more of the other folds in the database, are alpha-beta proteins. They also occur in highly populated architectural regions of fold space, adopting sandwich-like arrangements containing two or more layers of alpha-helices and beta-strands.Domains that exhibit a low gregariousness, are those that have very distinctive folds, with few common motifs or motifs that are packed in unusual arrangements. Most of the superhelices exhibit low gregariousness despite containing some commonly occurring super-secondary structural motifs. In these folds, these common motifs are combined in an unusual way and represent a small proportion of the fold (<10%). Our results suggest that fold space may be considered as continuous for some architectural arrangements (e.g. alpha-beta sandwiches), in that super-secondary motifs can be used to link neighbouring fold groups. However, in other regions of fold space much more discrete topologies are observed with little similarity between folds.
spellingShingle Harrison, A
Pearl, F
Mott, R
Thornton, J
Orengo, C
Quantifying the similarities within fold space.
title Quantifying the similarities within fold space.
title_full Quantifying the similarities within fold space.
title_fullStr Quantifying the similarities within fold space.
title_full_unstemmed Quantifying the similarities within fold space.
title_short Quantifying the similarities within fold space.
title_sort quantifying the similarities within fold space
work_keys_str_mv AT harrisona quantifyingthesimilaritieswithinfoldspace
AT pearlf quantifyingthesimilaritieswithinfoldspace
AT mottr quantifyingthesimilaritieswithinfoldspace
AT thorntonj quantifyingthesimilaritieswithinfoldspace
AT orengoc quantifyingthesimilaritieswithinfoldspace