Adverse drug reaction prediction using voting ensemble training approach

Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested i...

Full description

Bibliographic Details
Main Authors: Milad Besharatifard, Zahra Ghorbanali, Fatemeh Zare Mirakabad
Format: Article
Language:English
Published: Amirkabir University of Technology 2024-01-01
Series:AUT Journal of Mathematics and Computing
Subjects:
Online Access:https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdf
_version_ 1797307052596396032
author Milad Besharatifard
Zahra Ghorbanali
Fatemeh Zare Mirakabad
author_facet Milad Besharatifard
Zahra Ghorbanali
Fatemeh Zare Mirakabad
author_sort Milad Besharatifard
collection DOAJ
description Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.
first_indexed 2024-03-08T00:51:45Z
format Article
id doaj.art-3940f4007be24e42bd58b9cc2aeeda3b
institution Directory Open Access Journal
issn 2783-2449
2783-2287
language English
last_indexed 2024-03-08T00:51:45Z
publishDate 2024-01-01
publisher Amirkabir University of Technology
record_format Article
series AUT Journal of Mathematics and Computing
spelling doaj.art-3940f4007be24e42bd58b9cc2aeeda3b2024-02-14T19:42:47ZengAmirkabir University of TechnologyAUT Journal of Mathematics and Computing2783-24492783-22872024-01-0151456010.22060/ajmc.2023.21538.10915122Adverse drug reaction prediction using voting ensemble training approachMilad Besharatifard0Zahra Ghorbanali1Fatemeh Zare Mirakabad2Computational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranComputational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranComputational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranIdentifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdfadverse drug reactionmachine learningrandom forestrare adverse reactionsunbalanced dataset
spellingShingle Milad Besharatifard
Zahra Ghorbanali
Fatemeh Zare Mirakabad
Adverse drug reaction prediction using voting ensemble training approach
AUT Journal of Mathematics and Computing
adverse drug reaction
machine learning
random forest
rare adverse reactions
unbalanced dataset
title Adverse drug reaction prediction using voting ensemble training approach
title_full Adverse drug reaction prediction using voting ensemble training approach
title_fullStr Adverse drug reaction prediction using voting ensemble training approach
title_full_unstemmed Adverse drug reaction prediction using voting ensemble training approach
title_short Adverse drug reaction prediction using voting ensemble training approach
title_sort adverse drug reaction prediction using voting ensemble training approach
topic adverse drug reaction
machine learning
random forest
rare adverse reactions
unbalanced dataset
url https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdf
work_keys_str_mv AT miladbesharatifard adversedrugreactionpredictionusingvotingensembletrainingapproach
AT zahraghorbanali adversedrugreactionpredictionusingvotingensembletrainingapproach
AT fatemehzaremirakabad adversedrugreactionpredictionusingvotingensembletrainingapproach