Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem

Drug-induced liver injury is a major concern in the drug development process. Expensive and time-consuming <i>in vitro</i> and <i>in vivo</i> studies do not reflect the complexity of the phenomenon. Complementary to wet lab methods are <i>in silico</i> approaches,...

Full description

Bibliographic Details
Main Authors: Benjamin Bajželj, Viktor Drgan
Format: Article
Language:English
Published: MDPI AG 2020-01-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/25/3/481
_version_ 1818277616974561280
author Benjamin Bajželj
Viktor Drgan
author_facet Benjamin Bajželj
Viktor Drgan
author_sort Benjamin Bajželj
collection DOAJ
description Drug-induced liver injury is a major concern in the drug development process. Expensive and time-consuming <i>in vitro</i> and <i>in vivo</i> studies do not reflect the complexity of the phenomenon. Complementary to wet lab methods are <i>in silico</i> approaches, which present a cost-efficient method for toxicity prediction. The aim of our study was to explore the capabilities of counter-propagation artificial neural networks (CPANNs) for the classification of an imbalanced dataset related to idiosyncratic drug-induced liver injury and to develop a model for prediction of the hepatotoxic potential of drugs. Genetic algorithm optimization of CPANN models was used to build models for the classification of drugs into hepatotoxic and non-hepatotoxic class using molecular descriptors. For the classification of an imbalanced dataset, we modified the classical CPANN training algorithm by integrating random subsampling into the training procedure of CPANN to improve the classification ability of CPANN. According to the number of models accepted by internal validation and according to the prediction statistics on the external set, we concluded that using an imbalanced set with balanced subsampling in each learning epoch is a better approach compared to using a fixed balanced set in the case of the counter-propagation artificial neural network learning methodology.
first_indexed 2024-12-12T23:04:23Z
format Article
id doaj.art-b4513ba37f2e423e8906c9e4fbed32ac
institution Directory Open Access Journal
issn 1420-3049
language English
last_indexed 2024-12-12T23:04:23Z
publishDate 2020-01-01
publisher MDPI AG
record_format Article
series Molecules
spelling doaj.art-b4513ba37f2e423e8906c9e4fbed32ac2022-12-22T00:08:44ZengMDPI AGMolecules1420-30492020-01-0125348110.3390/molecules25030481molecules25030481Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification ProblemBenjamin Bajželj0Viktor Drgan1National Institute of Chemistry, Hajdrihova 19, 1001 Ljubljana, SloveniaNational Institute of Chemistry, Hajdrihova 19, 1001 Ljubljana, SloveniaDrug-induced liver injury is a major concern in the drug development process. Expensive and time-consuming <i>in vitro</i> and <i>in vivo</i> studies do not reflect the complexity of the phenomenon. Complementary to wet lab methods are <i>in silico</i> approaches, which present a cost-efficient method for toxicity prediction. The aim of our study was to explore the capabilities of counter-propagation artificial neural networks (CPANNs) for the classification of an imbalanced dataset related to idiosyncratic drug-induced liver injury and to develop a model for prediction of the hepatotoxic potential of drugs. Genetic algorithm optimization of CPANN models was used to build models for the classification of drugs into hepatotoxic and non-hepatotoxic class using molecular descriptors. For the classification of an imbalanced dataset, we modified the classical CPANN training algorithm by integrating random subsampling into the training procedure of CPANN to improve the classification ability of CPANN. According to the number of models accepted by internal validation and according to the prediction statistics on the external set, we concluded that using an imbalanced set with balanced subsampling in each learning epoch is a better approach compared to using a fixed balanced set in the case of the counter-propagation artificial neural network learning methodology.https://www.mdpi.com/1420-3049/25/3/481hepatotoxicitycounter-propagation artificial neural networksimbalanced datasetgenetic algorithmqsar
spellingShingle Benjamin Bajželj
Viktor Drgan
Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem
Molecules
hepatotoxicity
counter-propagation artificial neural networks
imbalanced dataset
genetic algorithm
qsar
title Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem
title_full Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem
title_fullStr Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem
title_full_unstemmed Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem
title_short Hepatotoxicity Modeling Using Counter-Propagation Artificial Neural Networks: Handling an Imbalanced Classification Problem
title_sort hepatotoxicity modeling using counter propagation artificial neural networks handling an imbalanced classification problem
topic hepatotoxicity
counter-propagation artificial neural networks
imbalanced dataset
genetic algorithm
qsar
url https://www.mdpi.com/1420-3049/25/3/481
work_keys_str_mv AT benjaminbajzelj hepatotoxicitymodelingusingcounterpropagationartificialneuralnetworkshandlinganimbalancedclassificationproblem
AT viktordrgan hepatotoxicitymodelingusingcounterpropagationartificialneuralnetworkshandlinganimbalancedclassificationproblem