Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction

Efficient molecular featurization is one of the major issues for machine learning models in drug design. Here, we propose a persistent Ricci curvature (PRC), in particular, Ollivier PRC (OPRC), for the molecular featurization and feature engineering, for the first time. The filtration process propos...

Full description

Bibliographic Details
Main Authors: Wee, Junjie, Xia, Kelin
Other Authors: School of Physical and Mathematical Sciences
Format: Journal Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160294
_version_ 1811688690459607040
author Wee, Junjie
Xia, Kelin
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Wee, Junjie
Xia, Kelin
author_sort Wee, Junjie
collection NTU
description Efficient molecular featurization is one of the major issues for machine learning models in drug design. Here, we propose a persistent Ricci curvature (PRC), in particular, Ollivier PRC (OPRC), for the molecular featurization and feature engineering, for the first time. The filtration process proposed in the persistent homology is employed to generate a series of nested molecular graphs. Persistence and variation of Ollivier Ricci curvatures on these nested graphs are defined as OPRC. Moreover, persistent attributes, which are statistical and combinatorial properties of OPRCs during the filtration process, are used as molecular descriptors and further combined with machine learning models, in particular, gradient boosting tree (GBT). Our OPRC-GBT model is used in the prediction of the protein-ligand binding affinity, which is one of the key steps in drug design. Based on three of the most commonly used data sets from the well-established protein-ligand binding databank, that is, PDBbind, we intensively test our model and compare with existing models. It has been found that our model can achieve the state-of-the-art results and has advantages over traditional molecular descriptors.
first_indexed 2024-10-01T05:36:13Z
format Journal Article
id ntu-10356/160294
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:36:13Z
publishDate 2022
record_format dspace
spelling ntu-10356/1602942022-07-19T01:51:49Z Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction Wee, Junjie Xia, Kelin School of Physical and Mathematical Sciences Science::Mathematics Metric-Measure-Spaces Aided Drug Design Efficient molecular featurization is one of the major issues for machine learning models in drug design. Here, we propose a persistent Ricci curvature (PRC), in particular, Ollivier PRC (OPRC), for the molecular featurization and feature engineering, for the first time. The filtration process proposed in the persistent homology is employed to generate a series of nested molecular graphs. Persistence and variation of Ollivier Ricci curvatures on these nested graphs are defined as OPRC. Moreover, persistent attributes, which are statistical and combinatorial properties of OPRCs during the filtration process, are used as molecular descriptors and further combined with machine learning models, in particular, gradient boosting tree (GBT). Our OPRC-GBT model is used in the prediction of the protein-ligand binding affinity, which is one of the key steps in drug design. Based on three of the most commonly used data sets from the well-established protein-ligand binding databank, that is, PDBbind, we intensively test our model and compare with existing models. It has been found that our model can achieve the state-of-the-art results and has advantages over traditional molecular descriptors. Ministry of Education (MOE) Nanyang Technological University This work was supported in part by Nanyang Technological University Startup Grant M4081842.110, Singapore Ministry of Education Academic Research fund Tier 1 RG109/19 and Tier 2 MOE2018-T2-1-033. 2022-07-19T01:51:49Z 2022-07-19T01:51:49Z 2021 Journal Article Wee, J. & Xia, K. (2021). Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction. Journal of Chemical Information and Modeling, 61(4), 1617-1626. https://dx.doi.org/10.1021/acs.jcim.0c01415 1549-9596 https://hdl.handle.net/10356/160294 10.1021/acs.jcim.0c01415 33724038 2-s2.0-85103780511 4 61 1617 1626 en M4081842.110 RG109/19 MOE2018-T2-1-033 Journal of Chemical Information and Modeling © 2021 American Chemical Society. All rights reserved.
spellingShingle Science::Mathematics
Metric-Measure-Spaces
Aided Drug Design
Wee, Junjie
Xia, Kelin
Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction
title Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction
title_full Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction
title_fullStr Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction
title_full_unstemmed Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction
title_short Ollivier persistent Ricci curvature-based machine learning for the protein-ligand binding affinity prediction
title_sort ollivier persistent ricci curvature based machine learning for the protein ligand binding affinity prediction
topic Science::Mathematics
Metric-Measure-Spaces
Aided Drug Design
url https://hdl.handle.net/10356/160294
work_keys_str_mv AT weejunjie ollivierpersistentriccicurvaturebasedmachinelearningfortheproteinligandbindingaffinityprediction
AT xiakelin ollivierpersistentriccicurvaturebasedmachinelearningfortheproteinligandbindingaffinityprediction