De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.

Function prediction frequently relies on comparing genes or gene products to search for relevant similarities. Because the number of protein structures with unknown function is mushrooming, however, we asked here whether such comparisons could be improved by focusing narrowly on the key functional f...

Full description

Bibliographic Details
Main Authors: R Matthew Ward, Serkan Erdin, Tuan A Tran, David M Kristensen, Andreas Martin Lisewski, Olivier Lichtarge
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2008-05-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2362850?pdf=render
_version_ 1817990185327001600
author R Matthew Ward
Serkan Erdin
Tuan A Tran
David M Kristensen
Andreas Martin Lisewski
Olivier Lichtarge
author_facet R Matthew Ward
Serkan Erdin
Tuan A Tran
David M Kristensen
Andreas Martin Lisewski
Olivier Lichtarge
author_sort R Matthew Ward
collection DOAJ
description Function prediction frequently relies on comparing genes or gene products to search for relevant similarities. Because the number of protein structures with unknown function is mushrooming, however, we asked here whether such comparisons could be improved by focusing narrowly on the key functional features of protein structures, as defined by the Evolutionary Trace (ET). Therefore a series of algorithms was built to (a) extract local motifs (3D templates) from protein structures based on ET ranking of residue importance; (b) to assess their geometric and evolutionary similarity to other structures; and (c) to transfer enzyme annotation whenever a plurality was reached across matches. Whereas a prototype had only been 80% accurate and was not scalable, here a speedy new matching algorithm enabled large-scale searches for reciprocal matches and thus raised annotation specificity to 100% in both positive and negative controls of 49 enzymes and 50 non-enzymes, respectively-in one case even identifying an annotation error-while maintaining sensitivity ( approximately 60%). Critically, this Evolutionary Trace Annotation (ETA) pipeline requires no prior knowledge of functional mechanisms. It could thus be applied in a large-scale retrospective study of 1218 structural genomics enzymes and reached 92% accuracy. Likewise, it was applied to all 2935 unannotated structural genomics proteins and predicted enzymatic functions in 320 cases: 258 on first pass and 62 more on second pass. Controls and initial analyses suggest that these predictions are reliable. Thus the large-scale evolutionary integration of sequence-structure-function data, here through reciprocal identification of local, functionally important structural features, may contribute significantly to de-orphaning the structural proteome.
first_indexed 2024-04-14T00:56:41Z
format Article
id doaj.art-d39e1689c3b440b2a32d399f57e0c181
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-14T00:56:41Z
publishDate 2008-05-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-d39e1689c3b440b2a32d399f57e0c1812022-12-22T02:21:36ZengPublic Library of Science (PLoS)PLoS ONE1932-62032008-05-0135e213610.1371/journal.pone.0002136De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.R Matthew WardSerkan ErdinTuan A TranDavid M KristensenAndreas Martin LisewskiOlivier LichtargeFunction prediction frequently relies on comparing genes or gene products to search for relevant similarities. Because the number of protein structures with unknown function is mushrooming, however, we asked here whether such comparisons could be improved by focusing narrowly on the key functional features of protein structures, as defined by the Evolutionary Trace (ET). Therefore a series of algorithms was built to (a) extract local motifs (3D templates) from protein structures based on ET ranking of residue importance; (b) to assess their geometric and evolutionary similarity to other structures; and (c) to transfer enzyme annotation whenever a plurality was reached across matches. Whereas a prototype had only been 80% accurate and was not scalable, here a speedy new matching algorithm enabled large-scale searches for reciprocal matches and thus raised annotation specificity to 100% in both positive and negative controls of 49 enzymes and 50 non-enzymes, respectively-in one case even identifying an annotation error-while maintaining sensitivity ( approximately 60%). Critically, this Evolutionary Trace Annotation (ETA) pipeline requires no prior knowledge of functional mechanisms. It could thus be applied in a large-scale retrospective study of 1218 structural genomics enzymes and reached 92% accuracy. Likewise, it was applied to all 2935 unannotated structural genomics proteins and predicted enzymatic functions in 320 cases: 258 on first pass and 62 more on second pass. Controls and initial analyses suggest that these predictions are reliable. Thus the large-scale evolutionary integration of sequence-structure-function data, here through reciprocal identification of local, functionally important structural features, may contribute significantly to de-orphaning the structural proteome.http://europepmc.org/articles/PMC2362850?pdf=render
spellingShingle R Matthew Ward
Serkan Erdin
Tuan A Tran
David M Kristensen
Andreas Martin Lisewski
Olivier Lichtarge
De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.
PLoS ONE
title De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.
title_full De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.
title_fullStr De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.
title_full_unstemmed De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.
title_short De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.
title_sort de orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features
url http://europepmc.org/articles/PMC2362850?pdf=render
work_keys_str_mv AT rmatthewward deorphaningthestructuralproteomethroughreciprocalcomparisonofevolutionarilyimportantstructuralfeatures
AT serkanerdin deorphaningthestructuralproteomethroughreciprocalcomparisonofevolutionarilyimportantstructuralfeatures
AT tuanatran deorphaningthestructuralproteomethroughreciprocalcomparisonofevolutionarilyimportantstructuralfeatures
AT davidmkristensen deorphaningthestructuralproteomethroughreciprocalcomparisonofevolutionarilyimportantstructuralfeatures
AT andreasmartinlisewski deorphaningthestructuralproteomethroughreciprocalcomparisonofevolutionarilyimportantstructuralfeatures
AT olivierlichtarge deorphaningthestructuralproteomethroughreciprocalcomparisonofevolutionarilyimportantstructuralfeatures