Pareto Optimization to Accelerate Multi-Objective Virtual Screening

The discovery of therapeutic molecules is fundamentally a multi-objective optimization problem. One formulation of the problem is to identify molecules that simultaneously exhibit strong binding affinity for a target protein, minimal off-target interactions, and suitable pharmacokinetic properties....

Full beskrivning

Bibliografiska uppgifter
Huvudupphovsmän: Fromer, Jenna C., Graff, David E., Coley, Connor W.
Övriga upphovsmän: Massachusetts Institute of Technology. Department of Chemical Engineering
Materialtyp: Artikel
Publicerad: Royal Society of Chemistry 2024
Länkar:https://hdl.handle.net/1721.1/156714
_version_ 1826189934481899520
author Fromer, Jenna C.
Graff, David E.
Coley, Connor W.
author2 Massachusetts Institute of Technology. Department of Chemical Engineering
author_facet Massachusetts Institute of Technology. Department of Chemical Engineering
Fromer, Jenna C.
Graff, David E.
Coley, Connor W.
author_sort Fromer, Jenna C.
collection MIT
description The discovery of therapeutic molecules is fundamentally a multi-objective optimization problem. One formulation of the problem is to identify molecules that simultaneously exhibit strong binding affinity for a target protein, minimal off-target interactions, and suitable pharmacokinetic properties. Inspired by prior work that uses active learning to accelerate the identification of strong binders, we implement multi-objective Bayesian optimization to reduce the computational cost of multi-property virtual screening and apply it to the identification of ligands predicted to be selective based on docking scores to on- and off-targets. We demonstrate the superiority of Pareto optimization over scalarization across three case studies. Further, we use the developed optimization tool to search a virtual library of over 4M molecules for those predicted to be selective dual inhibitors of EGFR and IGF1R, acquiring 100% of the molecules that form the library's Pareto front after exploring only 8% of the library. This workflow and associated open source software can reduce the screening burden of molecular design projects and is complementary to research aiming to improve the accuracy of binding predictions and other molecular properties.
first_indexed 2024-09-23T08:30:55Z
format Article
id mit-1721.1/156714
institution Massachusetts Institute of Technology
last_indexed 2025-02-19T04:17:02Z
publishDate 2024
publisher Royal Society of Chemistry
record_format dspace
spelling mit-1721.1/1567142024-12-23T06:19:02Z Pareto Optimization to Accelerate Multi-Objective Virtual Screening Fromer, Jenna C. Graff, David E. Coley, Connor W. Massachusetts Institute of Technology. Department of Chemical Engineering Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science The discovery of therapeutic molecules is fundamentally a multi-objective optimization problem. One formulation of the problem is to identify molecules that simultaneously exhibit strong binding affinity for a target protein, minimal off-target interactions, and suitable pharmacokinetic properties. Inspired by prior work that uses active learning to accelerate the identification of strong binders, we implement multi-objective Bayesian optimization to reduce the computational cost of multi-property virtual screening and apply it to the identification of ligands predicted to be selective based on docking scores to on- and off-targets. We demonstrate the superiority of Pareto optimization over scalarization across three case studies. Further, we use the developed optimization tool to search a virtual library of over 4M molecules for those predicted to be selective dual inhibitors of EGFR and IGF1R, acquiring 100% of the molecules that form the library's Pareto front after exploring only 8% of the library. This workflow and associated open source software can reduce the screening burden of molecular design projects and is complementary to research aiming to improve the accuracy of binding predictions and other molecular properties. 2024-09-12T19:50:44Z 2024-09-12T19:50:44Z 2024-01-24 Article http://purl.org/eprint/type/JournalArticle 2635-098X https://hdl.handle.net/1721.1/156714 Digital Discovery, 2024, 3, 467-481 https://doi.org/10.1039/D3DD00227F Digital Discovery Creative Commons Attribution-Noncommercial https://creativecommons.org/licenses/by-nc/3.0/ application/pdf Royal Society of Chemistry Royal Society of Chemistry
spellingShingle Fromer, Jenna C.
Graff, David E.
Coley, Connor W.
Pareto Optimization to Accelerate Multi-Objective Virtual Screening
title Pareto Optimization to Accelerate Multi-Objective Virtual Screening
title_full Pareto Optimization to Accelerate Multi-Objective Virtual Screening
title_fullStr Pareto Optimization to Accelerate Multi-Objective Virtual Screening
title_full_unstemmed Pareto Optimization to Accelerate Multi-Objective Virtual Screening
title_short Pareto Optimization to Accelerate Multi-Objective Virtual Screening
title_sort pareto optimization to accelerate multi objective virtual screening
url https://hdl.handle.net/1721.1/156714
work_keys_str_mv AT fromerjennac paretooptimizationtoacceleratemultiobjectivevirtualscreening
AT graffdavide paretooptimizationtoacceleratemultiobjectivevirtualscreening
AT coleyconnorw paretooptimizationtoacceleratemultiobjectivevirtualscreening