Extracting medicinal chemistry intuition via preference machine learning

Abstract The lead optimization process in drug discovery campaigns is an arduous endeavour where the input of many medicinal chemists is weighed in order to reach a desired molecular property profile. Building the expertise to successfully drive such projects collaboratively is a very time-consuming...

Full description

Bibliographic Details
Main Authors: Oh-Hyeon Choung, Riccardo Vianello, Marwin Segler, Nikolaus Stiefl, José Jiménez-Luna
Format: Article
Language:English
Published: Nature Portfolio 2023-10-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-023-42242-1
Description
Summary:Abstract The lead optimization process in drug discovery campaigns is an arduous endeavour where the input of many medicinal chemists is weighed in order to reach a desired molecular property profile. Building the expertise to successfully drive such projects collaboratively is a very time-consuming process that typically spans many years within a chemist’s career. In this work we aim to replicate this process by applying artificial intelligence learning-to-rank techniques on feedback that was obtained from 35 chemists at Novartis over the course of several months. We exemplify the usefulness of the learned proxies in routine tasks such as compound prioritization, motif rationalization, and biased de novo drug design. Annotated response data is provided, and developed models and code made available through a permissive open-source license.
ISSN:2041-1723