Protein language model-embedded geometric graphs power inter-protein contact prediction

Accurate prediction of contacting residue pairs between interacting proteins is very useful for structural characterization of protein–protein interactions. Although significant improvement has been made in inter-protein contact prediction recently, there is still a large room for improving the pred...

Full description

Bibliographic Details
Main Authors: Yunda Si, Chengfei Yan
Format: Article
Language:English
Published: eLife Sciences Publications Ltd 2024-04-01
Series:eLife
Subjects:
Online Access:https://elifesciences.org/articles/92184
_version_ 1797225667417341952
author Yunda Si
Chengfei Yan
author_facet Yunda Si
Chengfei Yan
author_sort Yunda Si
collection DOAJ
description Accurate prediction of contacting residue pairs between interacting proteins is very useful for structural characterization of protein–protein interactions. Although significant improvement has been made in inter-protein contact prediction recently, there is still a large room for improving the prediction accuracy. Here we present a new deep learning method referred to as PLMGraph-Inter for inter-protein contact prediction. Specifically, we employ rotationally and translationally invariant geometric graphs obtained from structures of interacting proteins to integrate multiple protein language models, which are successively transformed by graph encoders formed by geometric vector perceptrons and residual networks formed by dimensional hybrid residual blocks to predict inter-protein contacts. Extensive evaluation on multiple test sets illustrates that PLMGraph-Inter outperforms five top inter-protein contact prediction methods, including DeepHomo, GLINTER, CDPred, DeepHomo2, and DRN-1D2D_Inter, by large margins. In addition, we also show that the prediction of PLMGraph-Inter can complement the result of AlphaFold-Multimer. Finally, we show leveraging the contacts predicted by PLMGraph-Inter as constraints for protein–protein docking can dramatically improve its performance for protein complex structure prediction.
first_indexed 2024-04-24T14:12:39Z
format Article
id doaj.art-20faab102b134ac0a45c5b858a91fa76
institution Directory Open Access Journal
issn 2050-084X
language English
last_indexed 2024-04-24T14:12:39Z
publishDate 2024-04-01
publisher eLife Sciences Publications Ltd
record_format Article
series eLife
spelling doaj.art-20faab102b134ac0a45c5b858a91fa762024-04-03T08:59:21ZengeLife Sciences Publications LtdeLife2050-084X2024-04-011210.7554/eLife.92184Protein language model-embedded geometric graphs power inter-protein contact predictionYunda Si0Chengfei Yan1https://orcid.org/0000-0002-2010-6668School of Physics, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Physics, Huazhong University of Science and Technology, Wuhan, ChinaAccurate prediction of contacting residue pairs between interacting proteins is very useful for structural characterization of protein–protein interactions. Although significant improvement has been made in inter-protein contact prediction recently, there is still a large room for improving the prediction accuracy. Here we present a new deep learning method referred to as PLMGraph-Inter for inter-protein contact prediction. Specifically, we employ rotationally and translationally invariant geometric graphs obtained from structures of interacting proteins to integrate multiple protein language models, which are successively transformed by graph encoders formed by geometric vector perceptrons and residual networks formed by dimensional hybrid residual blocks to predict inter-protein contacts. Extensive evaluation on multiple test sets illustrates that PLMGraph-Inter outperforms five top inter-protein contact prediction methods, including DeepHomo, GLINTER, CDPred, DeepHomo2, and DRN-1D2D_Inter, by large margins. In addition, we also show that the prediction of PLMGraph-Inter can complement the result of AlphaFold-Multimer. Finally, we show leveraging the contacts predicted by PLMGraph-Inter as constraints for protein–protein docking can dramatically improve its performance for protein complex structure prediction.https://elifesciences.org/articles/92184protein–protein interactioninter-protein contact predictionprotein complex structure predictionprotein language modelsgeometric deep learningprotein–protein docking
spellingShingle Yunda Si
Chengfei Yan
Protein language model-embedded geometric graphs power inter-protein contact prediction
eLife
protein–protein interaction
inter-protein contact prediction
protein complex structure prediction
protein language models
geometric deep learning
protein–protein docking
title Protein language model-embedded geometric graphs power inter-protein contact prediction
title_full Protein language model-embedded geometric graphs power inter-protein contact prediction
title_fullStr Protein language model-embedded geometric graphs power inter-protein contact prediction
title_full_unstemmed Protein language model-embedded geometric graphs power inter-protein contact prediction
title_short Protein language model-embedded geometric graphs power inter-protein contact prediction
title_sort protein language model embedded geometric graphs power inter protein contact prediction
topic protein–protein interaction
inter-protein contact prediction
protein complex structure prediction
protein language models
geometric deep learning
protein–protein docking
url https://elifesciences.org/articles/92184
work_keys_str_mv AT yundasi proteinlanguagemodelembeddedgeometricgraphspowerinterproteincontactprediction
AT chengfeiyan proteinlanguagemodelembeddedgeometricgraphspowerinterproteincontactprediction