New machine learning and physics-based scoring functions for drug discovery
Abstract Scoring functions are essential for modern in silico drug discovery. However, the accurate prediction of binding affinity by scoring functions remains a challenging task. The performance of scoring functions is very heterogeneous across different target classes. Scoring functions based on p...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2021-02-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-82410-1 |
_version_ | 1818681917802807296 |
---|---|
author | Isabella A. Guedes André M. S. Barreto Diogo Marinho Eduardo Krempser Mélaine A. Kuenemann Olivier Sperandio Laurent E. Dardenne Maria A. Miteva |
author_facet | Isabella A. Guedes André M. S. Barreto Diogo Marinho Eduardo Krempser Mélaine A. Kuenemann Olivier Sperandio Laurent E. Dardenne Maria A. Miteva |
author_sort | Isabella A. Guedes |
collection | DOAJ |
description | Abstract Scoring functions are essential for modern in silico drug discovery. However, the accurate prediction of binding affinity by scoring functions remains a challenging task. The performance of scoring functions is very heterogeneous across different target classes. Scoring functions based on precise physics-based descriptors better representing protein–ligand recognition process are strongly needed. We developed a set of new empirical scoring functions, named DockTScore, by explicitly accounting for physics-based terms combined with machine learning. Target-specific scoring functions were developed for two important drug targets, proteases and protein–protein interactions, representing an original class of molecules for drug discovery. Multiple linear regression (MLR), support vector machine and random forest algorithms were employed to derive general and target-specific scoring functions involving optimized MMFF94S force-field terms, solvation and lipophilic interactions terms, and an improved term accounting for ligand torsional entropy contribution to ligand binding. DockTScore scoring functions demonstrated to be competitive with the current best-evaluated scoring functions in terms of binding energy prediction and ranking on four DUD-E datasets and will be useful for in silico drug design for diverse proteins as well as for specific targets such as proteases and protein–protein interactions. Currently, the MLR DockTScore is available at www.dockthor.lncc.br . |
first_indexed | 2024-12-17T10:10:34Z |
format | Article |
id | doaj.art-864f5a4a8d6545a5a092b4189c6f4888 |
institution | Directory Open Access Journal |
issn | 2045-2322 |
language | English |
last_indexed | 2024-12-17T10:10:34Z |
publishDate | 2021-02-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj.art-864f5a4a8d6545a5a092b4189c6f48882022-12-21T21:53:02ZengNature PortfolioScientific Reports2045-23222021-02-0111111910.1038/s41598-021-82410-1New machine learning and physics-based scoring functions for drug discoveryIsabella A. Guedes0André M. S. Barreto1Diogo Marinho2Eduardo Krempser3Mélaine A. Kuenemann4Olivier Sperandio5Laurent E. Dardenne6Maria A. Miteva7Laboratório Nacional de Computação CientíficaLaboratório Nacional de Computação CientíficaLaboratório Nacional de Computação CientíficaFundação Oswaldo CruzInserm U973, Université Paris DiderotInserm U973, Université Paris DiderotLaboratório Nacional de Computação CientíficaInserm U973, Université Paris DiderotAbstract Scoring functions are essential for modern in silico drug discovery. However, the accurate prediction of binding affinity by scoring functions remains a challenging task. The performance of scoring functions is very heterogeneous across different target classes. Scoring functions based on precise physics-based descriptors better representing protein–ligand recognition process are strongly needed. We developed a set of new empirical scoring functions, named DockTScore, by explicitly accounting for physics-based terms combined with machine learning. Target-specific scoring functions were developed for two important drug targets, proteases and protein–protein interactions, representing an original class of molecules for drug discovery. Multiple linear regression (MLR), support vector machine and random forest algorithms were employed to derive general and target-specific scoring functions involving optimized MMFF94S force-field terms, solvation and lipophilic interactions terms, and an improved term accounting for ligand torsional entropy contribution to ligand binding. DockTScore scoring functions demonstrated to be competitive with the current best-evaluated scoring functions in terms of binding energy prediction and ranking on four DUD-E datasets and will be useful for in silico drug design for diverse proteins as well as for specific targets such as proteases and protein–protein interactions. Currently, the MLR DockTScore is available at www.dockthor.lncc.br .https://doi.org/10.1038/s41598-021-82410-1 |
spellingShingle | Isabella A. Guedes André M. S. Barreto Diogo Marinho Eduardo Krempser Mélaine A. Kuenemann Olivier Sperandio Laurent E. Dardenne Maria A. Miteva New machine learning and physics-based scoring functions for drug discovery Scientific Reports |
title | New machine learning and physics-based scoring functions for drug discovery |
title_full | New machine learning and physics-based scoring functions for drug discovery |
title_fullStr | New machine learning and physics-based scoring functions for drug discovery |
title_full_unstemmed | New machine learning and physics-based scoring functions for drug discovery |
title_short | New machine learning and physics-based scoring functions for drug discovery |
title_sort | new machine learning and physics based scoring functions for drug discovery |
url | https://doi.org/10.1038/s41598-021-82410-1 |
work_keys_str_mv | AT isabellaaguedes newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT andremsbarreto newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT diogomarinho newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT eduardokrempser newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT melaineakuenemann newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT oliviersperandio newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT laurentedardenne newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery AT mariaamiteva newmachinelearningandphysicsbasedscoringfunctionsfordrugdiscovery |