MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation

(1) Background: Data accuracy plays a key role in determining the model performances and the field of metabolism prediction suffers from the lack of truly reliable data. To enhance the accuracy of metabolic data, we recently proposed a manually curated database collected by a meta-analysis of the sp...

Full description

Bibliographic Details
Main Authors: Angelica Mazzolari, Luca Sommaruga, Alessandro Pedretti, Giulio Vistoli
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/26/7/2098
_version_ 1797538711793041408
author Angelica Mazzolari
Luca Sommaruga
Alessandro Pedretti
Giulio Vistoli
author_facet Angelica Mazzolari
Luca Sommaruga
Alessandro Pedretti
Giulio Vistoli
author_sort Angelica Mazzolari
collection DOAJ
description (1) Background: Data accuracy plays a key role in determining the model performances and the field of metabolism prediction suffers from the lack of truly reliable data. To enhance the accuracy of metabolic data, we recently proposed a manually curated database collected by a meta-analysis of the specialized literature (MetaQSAR). Here we aim to further increase data accuracy by focusing on publications reporting exhaustive metabolic trees. This selection should indeed reduce the number of false negative data. (2) Methods: A new metabolic database (MetaTREE) was thus collected and utilized to extract a dataset for metabolic data concerning glutathione conjugation (MT-dataset). After proper pre-processing, this dataset, along with the corresponding dataset extracted from MetaQSAR (MQ-dataset), was utilized to develop binary classification models using a random forest algorithm. (3) Results: The comparison of the models generated by the two collected datasets reveals the better performances reached by the MT-dataset (MCC raised from 0.63 to 0.67, sensitivity from 0.56 to 0.58). The analysis of the applicability domain also confirms that the model based on the MT-dataset shows a more robust predictive power with a larger applicability domain. (4) Conclusions: These results confirm that focusing on metabolic trees represents a convenient approach to increase data accuracy by reducing the false negative cases. The encouraging performances shown by the models developed by the MT-dataset invites to use of MetaTREE for predictive studies in the field of xenobiotic metabolism.
first_indexed 2024-03-10T12:34:17Z
format Article
id doaj.art-e086f426da0648c8901a5aeeb96e036b
institution Directory Open Access Journal
issn 1420-3049
language English
last_indexed 2024-03-10T12:34:17Z
publishDate 2021-04-01
publisher MDPI AG
record_format Article
series Molecules
spelling doaj.art-e086f426da0648c8901a5aeeb96e036b2023-11-21T14:24:10ZengMDPI AGMolecules1420-30492021-04-01267209810.3390/molecules26072098MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione ConjugationAngelica Mazzolari0Luca Sommaruga1Alessandro Pedretti2Giulio Vistoli3Dipartimento di Scienze Farmaceutiche, Università degli Studi di Milano, Via Mangiagalli 25, I-20133 Milano, ItalyDipartimento di Scienze Farmaceutiche, Università degli Studi di Milano, Via Mangiagalli 25, I-20133 Milano, ItalyDipartimento di Scienze Farmaceutiche, Università degli Studi di Milano, Via Mangiagalli 25, I-20133 Milano, ItalyDipartimento di Scienze Farmaceutiche, Università degli Studi di Milano, Via Mangiagalli 25, I-20133 Milano, Italy(1) Background: Data accuracy plays a key role in determining the model performances and the field of metabolism prediction suffers from the lack of truly reliable data. To enhance the accuracy of metabolic data, we recently proposed a manually curated database collected by a meta-analysis of the specialized literature (MetaQSAR). Here we aim to further increase data accuracy by focusing on publications reporting exhaustive metabolic trees. This selection should indeed reduce the number of false negative data. (2) Methods: A new metabolic database (MetaTREE) was thus collected and utilized to extract a dataset for metabolic data concerning glutathione conjugation (MT-dataset). After proper pre-processing, this dataset, along with the corresponding dataset extracted from MetaQSAR (MQ-dataset), was utilized to develop binary classification models using a random forest algorithm. (3) Results: The comparison of the models generated by the two collected datasets reveals the better performances reached by the MT-dataset (MCC raised from 0.63 to 0.67, sensitivity from 0.56 to 0.58). The analysis of the applicability domain also confirms that the model based on the MT-dataset shows a more robust predictive power with a larger applicability domain. (4) Conclusions: These results confirm that focusing on metabolic trees represents a convenient approach to increase data accuracy by reducing the false negative cases. The encouraging performances shown by the models developed by the MT-dataset invites to use of MetaTREE for predictive studies in the field of xenobiotic metabolism.https://www.mdpi.com/1420-3049/26/7/2098drug metabolismglutathione conjugationdata accuracymetabolic treeMetaQSARclassification algorithms
spellingShingle Angelica Mazzolari
Luca Sommaruga
Alessandro Pedretti
Giulio Vistoli
MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation
Molecules
drug metabolism
glutathione conjugation
data accuracy
metabolic tree
MetaQSAR
classification algorithms
title MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation
title_full MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation
title_fullStr MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation
title_full_unstemmed MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation
title_short MetaTREE, a Novel Database Focused on Metabolic Trees, Predicts an Important Detoxification Mechanism: The Glutathione Conjugation
title_sort metatree a novel database focused on metabolic trees predicts an important detoxification mechanism the glutathione conjugation
topic drug metabolism
glutathione conjugation
data accuracy
metabolic tree
MetaQSAR
classification algorithms
url https://www.mdpi.com/1420-3049/26/7/2098
work_keys_str_mv AT angelicamazzolari metatreeanoveldatabasefocusedonmetabolictreespredictsanimportantdetoxificationmechanismtheglutathioneconjugation
AT lucasommaruga metatreeanoveldatabasefocusedonmetabolictreespredictsanimportantdetoxificationmechanismtheglutathioneconjugation
AT alessandropedretti metatreeanoveldatabasefocusedonmetabolictreespredictsanimportantdetoxificationmechanismtheglutathioneconjugation
AT giuliovistoli metatreeanoveldatabasefocusedonmetabolictreespredictsanimportantdetoxificationmechanismtheglutathioneconjugation