Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions

Nowadays, the problem of the model’s applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described...

Full description

Bibliographic Details
Main Authors: Assima Rakhimbekova, Timur I. Madzhidov, Ramil I. Nugmanov, Timur R. Gimadiev, Igor I. Baskin, Alexandre Varnek
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:https://www.mdpi.com/1422-0067/21/15/5542
_version_ 1797560469794324480
author Assima Rakhimbekova
Timur I. Madzhidov
Ramil I. Nugmanov
Timur R. Gimadiev
Igor I. Baskin
Alexandre Varnek
author_facet Assima Rakhimbekova
Timur I. Madzhidov
Ramil I. Nugmanov
Timur R. Gimadiev
Igor I. Baskin
Alexandre Varnek
author_sort Assima Rakhimbekova
collection DOAJ
description Nowadays, the problem of the model’s applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described in the literature, no one for chemical reactions (Quantitative Reaction-Property Relationships (QRPR)) has been reported to date. The point is that a chemical reaction is a much more complex object than an individual molecule, and its yield, thermodynamic and kinetic characteristics depend not only on the structures of reactants and products but also on experimental conditions. The QRPR models’ performance largely depends on the way that chemical transformation is encoded. In this study, various AD definition methods extensively used in QSAR/QSPR studies of individual molecules, as well as several novel approaches suggested in this work for reactions, were benchmarked on several reaction datasets. The ability to exclude wrong reaction types, increase coverage, improve the model performance and detect Y-outliers were tested. As a result, several “best” AD definitions for the QRPR models predicting reaction characteristics have been revealed and tested on a previously published external dataset with a clear AD definition problem.
first_indexed 2024-03-10T18:01:00Z
format Article
id doaj.art-680f30657de04cb7bcbec989d52edcd6
institution Directory Open Access Journal
issn 1661-6596
1422-0067
language English
last_indexed 2024-03-10T18:01:00Z
publishDate 2020-08-01
publisher MDPI AG
record_format Article
series International Journal of Molecular Sciences
spelling doaj.art-680f30657de04cb7bcbec989d52edcd62023-11-20T08:53:22ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672020-08-012115554210.3390/ijms21155542Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical ReactionsAssima Rakhimbekova0Timur I. Madzhidov1Ramil I. Nugmanov2Timur R. Gimadiev3Igor I. Baskin4Alexandre Varnek5A.M. Butlerov Institute of Chemistry, Kazan Federal University, 420008 Kazan, RussiaA.M. Butlerov Institute of Chemistry, Kazan Federal University, 420008 Kazan, RussiaA.M. Butlerov Institute of Chemistry, Kazan Federal University, 420008 Kazan, RussiaInstitute for Chemical Reaction Design and Discovery, Hokkaido University, Sapporo 001-0021, JapanA.M. Butlerov Institute of Chemistry, Kazan Federal University, 420008 Kazan, RussiaInstitute for Chemical Reaction Design and Discovery, Hokkaido University, Sapporo 001-0021, JapanNowadays, the problem of the model’s applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described in the literature, no one for chemical reactions (Quantitative Reaction-Property Relationships (QRPR)) has been reported to date. The point is that a chemical reaction is a much more complex object than an individual molecule, and its yield, thermodynamic and kinetic characteristics depend not only on the structures of reactants and products but also on experimental conditions. The QRPR models’ performance largely depends on the way that chemical transformation is encoded. In this study, various AD definition methods extensively used in QSAR/QSPR studies of individual molecules, as well as several novel approaches suggested in this work for reactions, were benchmarked on several reaction datasets. The ability to exclude wrong reaction types, increase coverage, improve the model performance and detect Y-outliers were tested. As a result, several “best” AD definitions for the QRPR models predicting reaction characteristics have been revealed and tested on a previously published external dataset with a clear AD definition problem.https://www.mdpi.com/1422-0067/21/15/5542applicability domainQuantitative Reaction–Property RelationshipQSAR/QSPRchemical reactionschemoinformaticsmachine learning
spellingShingle Assima Rakhimbekova
Timur I. Madzhidov
Ramil I. Nugmanov
Timur R. Gimadiev
Igor I. Baskin
Alexandre Varnek
Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
International Journal of Molecular Sciences
applicability domain
Quantitative Reaction–Property Relationship
QSAR/QSPR
chemical reactions
chemoinformatics
machine learning
title Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_full Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_fullStr Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_full_unstemmed Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_short Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_sort comprehensive analysis of applicability domains of qspr models for chemical reactions
topic applicability domain
Quantitative Reaction–Property Relationship
QSAR/QSPR
chemical reactions
chemoinformatics
machine learning
url https://www.mdpi.com/1422-0067/21/15/5542
work_keys_str_mv AT assimarakhimbekova comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT timurimadzhidov comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT ramilinugmanov comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT timurrgimadiev comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT igoribaskin comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT alexandrevarnek comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions