ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.

The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because man...

Full description

Bibliographic Details
Main Authors: Sabrina de Azevedo Silveira, Raquel Cardoso de Melo-Minardi, Carlos Henrique da Silveira, Marcelo Matos Santoro, Wagner Meira
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3929618?pdf=render
_version_ 1818275795929399296
author Sabrina de Azevedo Silveira
Sabrina de Azevedo Silveira
Raquel Cardoso de Melo-Minardi
Carlos Henrique da Silveira
Marcelo Matos Santoro
Wagner Meira
author_facet Sabrina de Azevedo Silveira
Sabrina de Azevedo Silveira
Raquel Cardoso de Melo-Minardi
Carlos Henrique da Silveira
Marcelo Matos Santoro
Wagner Meira
author_sort Sabrina de Azevedo Silveira
collection DOAJ
description The volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because manually annotating them is not possible. Thus, efficient and precise automatic annotation methods are required to ensure the quality and reliability of both the biological data and associated annotations. We proposed ENZYMatic Annotation Predictor (ENZYMAP), a technique to characterize and predict EC number changes based on annotations from UniProt/Swiss-Prot using a supervised learning approach. We evaluated ENZYMAP experimentally, using test data sets from both UniProt/Swiss-Prot and UniProt/TrEMBL, and showed that predicting EC changes using selected types of annotation is possible. Finally, we compared ENZYMAP and DETECT with respect to their predictions and checked both against the UniProt/Swiss-Prot annotations. ENZYMAP was shown to be more accurate than DETECT, coming closer to the actual changes in UniProt/Swiss-Prot. Our proposal is intended to be an automatic complementary method (that can be used together with other techniques like the ones based on protein sequence and structure) that helps to improve the quality and reliability of enzyme annotations over time, suggesting possible corrections, anticipating annotation changes and propagating the implicit knowledge for the whole dataset.
first_indexed 2024-12-12T22:35:26Z
format Article
id doaj.art-8fc3ebfa2a6a4d528655e22d7889e680
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-12T22:35:26Z
publishDate 2014-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-8fc3ebfa2a6a4d528655e22d7889e6802022-12-22T00:09:29ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0192e8916210.1371/journal.pone.0089162ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.Sabrina de Azevedo SilveiraSabrina de Azevedo SilveiraRaquel Cardoso de Melo-MinardiCarlos Henrique da SilveiraMarcelo Matos SantoroWagner MeiraThe volume and diversity of biological data are increasing at very high rates. Vast amounts of protein sequences and structures, protein and genetic interactions and phenotype studies have been produced. The majority of data generated by high-throughput devices is automatically annotated because manually annotating them is not possible. Thus, efficient and precise automatic annotation methods are required to ensure the quality and reliability of both the biological data and associated annotations. We proposed ENZYMatic Annotation Predictor (ENZYMAP), a technique to characterize and predict EC number changes based on annotations from UniProt/Swiss-Prot using a supervised learning approach. We evaluated ENZYMAP experimentally, using test data sets from both UniProt/Swiss-Prot and UniProt/TrEMBL, and showed that predicting EC changes using selected types of annotation is possible. Finally, we compared ENZYMAP and DETECT with respect to their predictions and checked both against the UniProt/Swiss-Prot annotations. ENZYMAP was shown to be more accurate than DETECT, coming closer to the actual changes in UniProt/Swiss-Prot. Our proposal is intended to be an automatic complementary method (that can be used together with other techniques like the ones based on protein sequence and structure) that helps to improve the quality and reliability of enzyme annotations over time, suggesting possible corrections, anticipating annotation changes and propagating the implicit knowledge for the whole dataset.http://europepmc.org/articles/PMC3929618?pdf=render
spellingShingle Sabrina de Azevedo Silveira
Sabrina de Azevedo Silveira
Raquel Cardoso de Melo-Minardi
Carlos Henrique da Silveira
Marcelo Matos Santoro
Wagner Meira
ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.
PLoS ONE
title ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.
title_full ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.
title_fullStr ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.
title_full_unstemmed ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.
title_short ENZYMAP: exploiting protein annotation for modeling and predicting EC number changes in UniProt/Swiss-Prot.
title_sort enzymap exploiting protein annotation for modeling and predicting ec number changes in uniprot swiss prot
url http://europepmc.org/articles/PMC3929618?pdf=render
work_keys_str_mv AT sabrinadeazevedosilveira enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT sabrinadeazevedosilveira enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT raquelcardosodemelominardi enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT carloshenriquedasilveira enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT marcelomatossantoro enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot
AT wagnermeira enzymapexploitingproteinannotationformodelingandpredictingecnumberchangesinuniprotswissprot