New machine learning and physics-based scoring functions for drug discovery

Abstract Scoring functions are essential for modern in silico drug discovery. However, the accurate prediction of binding affinity by scoring functions remains a challenging task. The performance of scoring functions is very heterogeneous across different target classes. Scoring functions based on p...

Full description

Bibliographic Details
Main Authors: Isabella A. Guedes, André M. S. Barreto, Diogo Marinho, Eduardo Krempser, Mélaine A. Kuenemann, Olivier Sperandio, Laurent E. Dardenne, Maria A. Miteva
Format: Article
Language:English
Published: Nature Portfolio 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-82410-1
_version_ 1818681917802807296
author Isabella A. Guedes
André M. S. Barreto
Diogo Marinho
Eduardo Krempser
Mélaine A. Kuenemann
Olivier Sperandio
Laurent E. Dardenne
Maria A. Miteva
author_facet Isabella A. Guedes
André M. S. Barreto
Diogo Marinho
Eduardo Krempser
Mélaine A. Kuenemann
Olivier Sperandio
Laurent E. Dardenne
Maria A. Miteva
author_sort Isabella A. Guedes
collection DOAJ
description Abstract Scoring functions are essential for modern in silico drug discovery. However, the accurate prediction of binding affinity by scoring functions remains a challenging task. The performance of scoring functions is very heterogeneous across different target classes. Scoring functions based on precise physics-based descriptors better representing protein–ligand recognition process are strongly needed. We developed a set of new empirical scoring functions, named DockTScore, by explicitly accounting for physics-based terms combined with machine learning. Target-specific scoring functions were developed for two important drug targets, proteases and protein–protein interactions, representing an original class of molecules for drug discovery. Multiple linear regression (MLR), support vector machine and random forest algorithms were employed to derive general and target-specific scoring functions involving optimized MMFF94S force-field terms, solvation and lipophilic interactions terms, and an improved term accounting for ligand torsional entropy contribution to ligand binding. DockTScore scoring functions demonstrated to be competitive with the current best-evaluated scoring functions in terms of binding energy prediction and ranking on four DUD-E datasets and will be useful for in silico drug design for diverse proteins as well as for specific targets such as proteases and protein–protein interactions. Currently, the MLR DockTScore is available at www.dockthor.lncc.br .
first_indexed 2024-12-17T10:10:34Z
format Article
id doaj.art-864f5a4a8d6545a5a092b4189c6f4888
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-12-17T10:10:34Z
publishDate 2021-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-864f5a4a8d6545a5a092b4189c6f48882022-12-21T21:53:02ZengNature PortfolioScientific Reports2045-23222021-02-0111111910.1038/s41598-021-82410-1New machine learning and physics-based scoring functions for drug discoveryIsabella A. Guedes0André M. S. Barreto1Diogo Marinho2Eduardo Krempser3Mélaine A. Kuenemann4Olivier Sperandio5Laurent E. Dardenne6Maria A. Miteva7Laboratório Nacional de Computação CientíficaLaboratório Nacional de Computação CientíficaLaboratório Nacional de Computação CientíficaFundação Oswaldo CruzInserm U973, Université Paris DiderotInserm U973, Université Paris DiderotLaboratório Nacional de Computação CientíficaInserm U973, Université Paris DiderotAbstract Scoring functions are essential for modern in silico drug discovery. However, the accurate prediction of binding affinity by scoring functions remains a challenging task. The performance of scoring functions is very heterogeneous across different target classes. Scoring functions based on precise physics-based descriptors better representing protein–ligand recognition process are strongly needed. We developed a set of new empirical scoring functions, named DockTScore, by explicitly accounting for physics-based terms combined with machine learning. Target-specific scoring functions were developed for two important drug targets, proteases and protein–protein interactions, representing an original class of molecules for drug discovery. Multiple linear regression (MLR), support vector machine and random forest algorithms were employed to derive general and target-specific scoring functions involving optimized MMFF94S force-field terms, solvation and lipophilic interactions terms, and an improved term accounting for ligand torsional entropy contribution to ligand binding. DockTScore scoring functions demonstrated to be competitive with the current best-evaluated scoring functions in terms of binding energy prediction and ranking on four DUD-E datasets and will be useful for in silico drug design for diverse proteins as well as for specific targets such as proteases and protein–protein interactions. Currently, the MLR DockTScore is available at www.dockthor.lncc.br .https://doi.org/10.1038/s41598-021-82410-1
spellingShingle Isabella A. Guedes
André M. S. Barreto
Diogo Marinho
Eduardo Krempser
Mélaine A. Kuenemann
Olivier Sperandio
Laurent E. Dardenne
Maria A. Miteva
New machine learning and physics-based scoring functions for drug discovery
Scientific Reports
title New machine learning and physics-based scoring functions for drug discovery
title_full New machine learning and physics-based scoring functions for drug discovery
title_fullStr New machine learning and physics-based scoring functions for drug discovery
title_full_unstemmed New machine learning and physics-based scoring functions for drug discovery
title_short New machine learning and physics-based scoring functions for drug discovery
title_sort new machine learning and physics based scoring functions for drug discovery
url https://doi.org/10.1038/s41598-021-82410-1
work_keys_str_mv AT isabellaaguedes newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT andremsbarreto newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT diogomarinho newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT eduardokrempser newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT melaineakuenemann newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT oliviersperandio newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT laurentedardenne newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery
AT mariaamiteva newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery