Adverse drug reaction prediction using voting ensemble training approach

Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested i...

Full description

Bibliographic Details
Main Authors:	Milad Besharatifard, Zahra Ghorbanali, Fatemeh Zare Mirakabad
Format:	Article
Language:	English
Published:	Amirkabir University of Technology 2024-01-01
Series:	AUT Journal of Mathematics and Computing
Subjects:	adverse drug reaction machine learning random forest rare adverse reactions unbalanced dataset
Online Access:	https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdf

_version_	1797307052596396032
author	Milad Besharatifard Zahra Ghorbanali Fatemeh Zare Mirakabad
author_facet	Milad Besharatifard Zahra Ghorbanali Fatemeh Zare Mirakabad
author_sort	Milad Besharatifard
collection	DOAJ
description	Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.
first_indexed	2024-03-08T00:51:45Z
format	Article
id	doaj.art-3940f4007be24e42bd58b9cc2aeeda3b
institution	Directory Open Access Journal
issn	2783-2449 2783-2287
language	English
last_indexed	2024-03-08T00:51:45Z
publishDate	2024-01-01
publisher	Amirkabir University of Technology
record_format	Article
series	AUT Journal of Mathematics and Computing
spelling	doaj.art-3940f4007be24e42bd58b9cc2aeeda3b2024-02-14T19:42:47ZengAmirkabir University of TechnologyAUT Journal of Mathematics and Computing2783-24492783-22872024-01-0151456010.22060/ajmc.2023.21538.10915122Adverse drug reaction prediction using voting ensemble training approachMilad Besharatifard0Zahra Ghorbanali1Fatemeh Zare Mirakabad2Computational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranComputational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranComputational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranIdentifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdfadverse drug reactionmachine learningrandom forestrare adverse reactionsunbalanced dataset
spellingShingle	Milad Besharatifard Zahra Ghorbanali Fatemeh Zare Mirakabad Adverse drug reaction prediction using voting ensemble training approach AUT Journal of Mathematics and Computing adverse drug reaction machine learning random forest rare adverse reactions unbalanced dataset
title	Adverse drug reaction prediction using voting ensemble training approach
title_full	Adverse drug reaction prediction using voting ensemble training approach
title_fullStr	Adverse drug reaction prediction using voting ensemble training approach
title_full_unstemmed	Adverse drug reaction prediction using voting ensemble training approach
title_short	Adverse drug reaction prediction using voting ensemble training approach
title_sort	adverse drug reaction prediction using voting ensemble training approach
topic	adverse drug reaction machine learning random forest rare adverse reactions unbalanced dataset
url	https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdf
work_keys_str_mv	AT miladbesharatifard adversedrugreactionpredictionusingvotingensembletrainingapproach AT zahraghorbanali adversedrugreactionpredictionusingvotingensembletrainingapproach AT fatemehzaremirakabad adversedrugreactionpredictionusingvotingensembletrainingapproach

Adverse drug reaction prediction using voting ensemble training approach

Similar Items