Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.

Are extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradict...

Full description

Bibliographic Details
Main Authors: Davide De Lucrezia, Debora Slanzi, Irene Poli, Fabio Polticelli, Giovanni Minervini
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3353917?pdf=render
_version_ 1818180559624470528
author Davide De Lucrezia
Debora Slanzi
Irene Poli
Fabio Polticelli
Giovanni Minervini
author_facet Davide De Lucrezia
Debora Slanzi
Irene Poli
Fabio Polticelli
Giovanni Minervini
author_sort Davide De Lucrezia
collection DOAJ
description Are extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradicting conclusions. Previous works in literature focused on the analysis of primary structure in an attempt to identify possible signature of evolutionary editing. Conversely, in this work we compare a set of 762 natural proteins with an average length of 70 amino acids and an equal number of completely random ones of comparable length on the basis of their structural features. We use an ad hoc Evolutionary Neural Network Algorithm (ENNA) in order to assess whether and to what extent natural proteins are edited from random polypeptides employing 11 different structure-related variables (i.e. net charge, volume, surface area, coil, alpha helix, beta sheet, percentage of coil, percentage of alpha helix, percentage of beta sheet, percentage of secondary structure and surface hydrophobicity). The ENNA algorithm is capable to correctly distinguish natural proteins from random ones with an accuracy of 94.36%. Furthermore, we study the structural features of 32 random polypeptides misclassified as natural ones to unveil any structural similarity to natural proteins. Results show that random proteins misclassified by the ENNA algorithm exhibit a significant fold similarity to portions or subdomains of extant proteins at atomic resolution. Altogether, our results suggest that natural proteins are significantly edited from random polypeptides and evolutionary editing can be readily detected analyzing structural features. Furthermore, we also show that the ENNA, employing simple structural descriptors, can predict whether a protein chain is natural or random.
first_indexed 2024-12-11T21:21:42Z
format Article
id doaj.art-9e5c6f169fa84952b1bfae925f119404
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-11T21:21:42Z
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-9e5c6f169fa84952b1bfae925f1194042022-12-22T00:50:27ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0175e3663410.1371/journal.pone.0036634Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.Davide De LucreziaDebora SlanziIrene PoliFabio PolticelliGiovanni MinerviniAre extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradicting conclusions. Previous works in literature focused on the analysis of primary structure in an attempt to identify possible signature of evolutionary editing. Conversely, in this work we compare a set of 762 natural proteins with an average length of 70 amino acids and an equal number of completely random ones of comparable length on the basis of their structural features. We use an ad hoc Evolutionary Neural Network Algorithm (ENNA) in order to assess whether and to what extent natural proteins are edited from random polypeptides employing 11 different structure-related variables (i.e. net charge, volume, surface area, coil, alpha helix, beta sheet, percentage of coil, percentage of alpha helix, percentage of beta sheet, percentage of secondary structure and surface hydrophobicity). The ENNA algorithm is capable to correctly distinguish natural proteins from random ones with an accuracy of 94.36%. Furthermore, we study the structural features of 32 random polypeptides misclassified as natural ones to unveil any structural similarity to natural proteins. Results show that random proteins misclassified by the ENNA algorithm exhibit a significant fold similarity to portions or subdomains of extant proteins at atomic resolution. Altogether, our results suggest that natural proteins are significantly edited from random polypeptides and evolutionary editing can be readily detected analyzing structural features. Furthermore, we also show that the ENNA, employing simple structural descriptors, can predict whether a protein chain is natural or random.http://europepmc.org/articles/PMC3353917?pdf=render
spellingShingle Davide De Lucrezia
Debora Slanzi
Irene Poli
Fabio Polticelli
Giovanni Minervini
Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
PLoS ONE
title Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
title_full Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
title_fullStr Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
title_full_unstemmed Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
title_short Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network.
title_sort do natural proteins differ from random sequences polypeptides natural vs random proteins classification using an evolutionary neural network
url http://europepmc.org/articles/PMC3353917?pdf=render
work_keys_str_mv AT davidedelucrezia donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork
AT deboraslanzi donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork
AT irenepoli donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork
AT fabiopolticelli donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork
AT giovanniminervini donaturalproteinsdifferfromrandomsequencespolypeptidesnaturalvsrandomproteinsclassificationusinganevolutionaryneuralnetwork