Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.

High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies whic...

Full description

Bibliographic Details
Main Authors: Eric Venner, Andreas Martin Lisewski, Serkan Erdin, R Matthew Ward, Shivas R Amin, Olivier Lichtarge
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2010-12-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3001439?pdf=render
_version_ 1818153022345183232
author Eric Venner
Andreas Martin Lisewski
Serkan Erdin
R Matthew Ward
Shivas R Amin
Olivier Lichtarge
author_facet Eric Venner
Andreas Martin Lisewski
Serkan Erdin
R Matthew Ward
Shivas R Amin
Olivier Lichtarge
author_sort Eric Venner
collection DOAJ
description High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.
first_indexed 2024-12-11T14:04:00Z
format Article
id doaj.art-5c6f76c62a524c0cb33972e40d7150e1
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-11T14:04:00Z
publishDate 2010-12-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-5c6f76c62a524c0cb33972e40d7150e12022-12-22T01:03:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-12-01512e1428610.1371/journal.pone.0014286Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.Eric VennerAndreas Martin LisewskiSerkan ErdinR Matthew WardShivas R AminOlivier LichtargeHigh-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.http://europepmc.org/articles/PMC3001439?pdf=render
spellingShingle Eric Venner
Andreas Martin Lisewski
Serkan Erdin
R Matthew Ward
Shivas R Amin
Olivier Lichtarge
Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.
PLoS ONE
title Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.
title_full Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.
title_fullStr Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.
title_full_unstemmed Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.
title_short Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.
title_sort accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities
url http://europepmc.org/articles/PMC3001439?pdf=render
work_keys_str_mv AT ericvenner accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT andreasmartinlisewski accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT serkanerdin accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT rmatthewward accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT shivasramin accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT olivierlichtarge accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities