Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration

Abstract The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical s...

Full description

Bibliographic Details
Main Authors: Lin Chen, Qing Shen, Jungang Lou
Format: Article
Language:English
Published: BMC 2023-04-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-023-05286-0
_version_ 1797836252481847296
author Lin Chen
Qing Shen
Jungang Lou
author_facet Lin Chen
Qing Shen
Jungang Lou
author_sort Lin Chen
collection DOAJ
description Abstract The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical space. Recently, some work combined reinforcement learning strategies with recurrent neural network (RNN)-based models to optimize the property of generated small molecules, which notably improved a batch of critical factors for these candidates. However, a common problem among these RNN-based methods is that several generated molecules have difficulty in synthesizing despite owning higher desired properties such as binding affinity. However, RNN-based framework better reproduces the molecule distribution among the training set than other categories of models during molecule exploration tasks. Thus, to optimize the whole exploration process and make it contribute to the optimization of specified molecules, we devised a light-weighted pipeline called Magicmol; this pipeline has a re-mastered RNN network and utilize SELFIES presentation instead of SMILES. Our backbone model achieved extraordinary performance while reducing the training cost; moreover, we devised reward truncate strategies to eliminate the model collapse problem. Additionally, adopting SELFIES presentation made it possible to combine STONED-SELFIES as a post-processing procedure for specified molecule optimization and quick chemical space exploration.
first_indexed 2024-04-09T15:06:55Z
format Article
id doaj.art-f4adafa4fb92440d9220ee33e090424a
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-09T15:06:55Z
publishDate 2023-04-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-f4adafa4fb92440d9220ee33e090424a2023-04-30T11:30:11ZengBMCBMC Bioinformatics1471-21052023-04-0124111810.1186/s12859-023-05286-0Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space explorationLin Chen0Qing Shen1Jungang Lou2Yangtze Delta Region (Huzhou) Institute of Intelligent Transportation, Huzhou UniversityYangtze Delta Region (Huzhou) Institute of Intelligent Transportation, Huzhou UniversityYangtze Delta Region (Huzhou) Institute of Intelligent Transportation, Huzhou UniversityAbstract The flourishment of machine learning and deep learning methods has boosted the development of cheminformatics, especially regarding the application of drug discovery and new material exploration. Lower time and space expenses make it possible for scientists to search the enormous chemical space. Recently, some work combined reinforcement learning strategies with recurrent neural network (RNN)-based models to optimize the property of generated small molecules, which notably improved a batch of critical factors for these candidates. However, a common problem among these RNN-based methods is that several generated molecules have difficulty in synthesizing despite owning higher desired properties such as binding affinity. However, RNN-based framework better reproduces the molecule distribution among the training set than other categories of models during molecule exploration tasks. Thus, to optimize the whole exploration process and make it contribute to the optimization of specified molecules, we devised a light-weighted pipeline called Magicmol; this pipeline has a re-mastered RNN network and utilize SELFIES presentation instead of SMILES. Our backbone model achieved extraordinary performance while reducing the training cost; moreover, we devised reward truncate strategies to eliminate the model collapse problem. Additionally, adopting SELFIES presentation made it possible to combine STONED-SELFIES as a post-processing procedure for specified molecule optimization and quick chemical space exploration.https://doi.org/10.1186/s12859-023-05286-0Generative modelsReinforcement learningDeep learningSynthetic accessibilityDe novo drug design
spellingShingle Lin Chen
Qing Shen
Jungang Lou
Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
BMC Bioinformatics
Generative models
Reinforcement learning
Deep learning
Synthetic accessibility
De novo drug design
title Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_full Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_fullStr Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_full_unstemmed Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_short Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration
title_sort magicmol a light weighted pipeline for drug like molecule evolution and quick chemical space exploration
topic Generative models
Reinforcement learning
Deep learning
Synthetic accessibility
De novo drug design
url https://doi.org/10.1186/s12859-023-05286-0
work_keys_str_mv AT linchen magicmolalightweightedpipelinefordruglikemoleculeevolutionandquickchemicalspaceexploration
AT qingshen magicmolalightweightedpipelinefordruglikemoleculeevolutionandquickchemicalspaceexploration
AT junganglou magicmolalightweightedpipelinefordruglikemoleculeevolutionandquickchemicalspaceexploration