Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
Are extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradict...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3353917?pdf=render |
_version_ | 1818180559624470528 |
---|---|
author | Davide De Lucrezia Debora Slanzi Irene Poli Fabio Polticelli Giovanni Minervini |
author_facet | Davide De Lucrezia Debora Slanzi Irene Poli Fabio Polticelli Giovanni Minervini |
author_sort | Davide De Lucrezia |
collection | DOAJ |
description | Are extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradicting conclusions. Previous works in literature focused on the analysis of primary structure in an attempt to identify possible signature of evolutionary editing. Conversely, in this work we compare a set of 762 natural proteins with an average length of 70 amino acids and an equal number of completely random ones of comparable length on the basis of their structural features. We use an ad hoc Evolutionary Neural Network Algorithm (ENNA) in order to assess whether and to what extent natural proteins are edited from random polypeptides employing 11 different structure-related variables (i.e. net charge, volume, surface area, coil, alpha helix, beta sheet, percentage of coil, percentage of alpha helix, percentage of beta sheet, percentage of secondary structure and surface hydrophobicity). The ENNA algorithm is capable to correctly distinguish natural proteins from random ones with an accuracy of 94.36%. Furthermore, we study the structural features of 32 random polypeptides misclassified as natural ones to unveil any structural similarity to natural proteins. Results show that random proteins misclassified by the ENNA algorithm exhibit a significant fold similarity to portions or subdomains of extant proteins at atomic resolution. Altogether, our results suggest that natural proteins are significantly edited from random polypeptides and evolutionary editing can be readily detected analyzing structural features. Furthermore, we also show that the ENNA, employing simple structural descriptors, can predict whether a protein chain is natural or random. |
first_indexed | 2024-12-11T21:21:42Z |
format | Article |
id | doaj.art-9e5c6f169fa84952b1bfae925f119404 |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-11T21:21:42Z |
publishDate | 2012-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-9e5c6f169fa84952b1bfae925f1194042022-12-22T00:50:27ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0175e3663410.1371/journal.pone.0036634Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.Davide De LucreziaDebora SlanziIrene PoliFabio PolticelliGiovanni MinerviniAre extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradicting conclusions. Previous works in literature focused on the analysis of primary structure in an attempt to identify possible signature of evolutionary editing. Conversely, in this work we compare a set of 762 natural proteins with an average length of 70 amino acids and an equal number of completely random ones of comparable length on the basis of their structural features. We use an ad hoc Evolutionary Neural Network Algorithm (ENNA) in order to assess whether and to what extent natural proteins are edited from random polypeptides employing 11 different structure-related variables (i.e. net charge, volume, surface area, coil, alpha helix, beta sheet, percentage of coil, percentage of alpha helix, percentage of beta sheet, percentage of secondary structure and surface hydrophobicity). The ENNA algorithm is capable to correctly distinguish natural proteins from random ones with an accuracy of 94.36%. Furthermore, we study the structural features of 32 random polypeptides misclassified as natural ones to unveil any structural similarity to natural proteins. Results show that random proteins misclassified by the ENNA algorithm exhibit a significant fold similarity to portions or subdomains of extant proteins at atomic resolution. Altogether, our results suggest that natural proteins are significantly edited from random polypeptides and evolutionary editing can be readily detected analyzing structural features. Furthermore, we also show that the ENNA, employing simple structural descriptors, can predict whether a protein chain is natural or random.http://europepmc.org/articles/PMC3353917?pdf=render |
spellingShingle | Davide De Lucrezia Debora Slanzi Irene Poli Fabio Polticelli Giovanni Minervini Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. PLoS ONE |
title | Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. |
title_full | Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. |
title_fullStr | Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. |
title_full_unstemmed | Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. |
title_short | Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. |
title_sort | do natural proteins differ from random sequences polypeptides natural vs random proteins classification using an evolutionary neural network |
url | http://europepmc.org/articles/PMC3353917?pdf=render |
work_keys_str_mv | AT davidedelucrezia donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork AT deboraslanzi donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork AT irenepoli donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork AT fabiopolticelli donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork AT giovanniminervini donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork |