Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes

Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing...

Full description

Bibliographic Details
Main Authors:	Albros Hermes Poot Velez, Fernando Fontove, Gabriel Del Rio
Format:	Article
Language:	English
Published:	MDPI AG 2020-07-01
Series:	International Journal of Molecular Sciences
Subjects:	residue cluster class protein–protein interaction machine learning
Online Access:	https://www.mdpi.com/1422-0067/21/13/4787

Description
Summary:	Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm–parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96–99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship.
ISSN:	1661-6596 1422-0067

Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes

Similar Items