Adverse drug reaction prediction using voting ensemble training approach
Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested i...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Amirkabir University of Technology
2024-01-01
|
Series: | AUT Journal of Mathematics and Computing |
Subjects: | |
Online Access: | https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdf |
_version_ | 1797307052596396032 |
---|---|
author | Milad Besharatifard Zahra Ghorbanali Fatemeh Zare Mirakabad |
author_facet | Milad Besharatifard Zahra Ghorbanali Fatemeh Zare Mirakabad |
author_sort | Milad Besharatifard |
collection | DOAJ |
description | Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs. |
first_indexed | 2024-03-08T00:51:45Z |
format | Article |
id | doaj.art-3940f4007be24e42bd58b9cc2aeeda3b |
institution | Directory Open Access Journal |
issn | 2783-2449 2783-2287 |
language | English |
last_indexed | 2024-03-08T00:51:45Z |
publishDate | 2024-01-01 |
publisher | Amirkabir University of Technology |
record_format | Article |
series | AUT Journal of Mathematics and Computing |
spelling | doaj.art-3940f4007be24e42bd58b9cc2aeeda3b2024-02-14T19:42:47ZengAmirkabir University of TechnologyAUT Journal of Mathematics and Computing2783-24492783-22872024-01-0151456010.22060/ajmc.2023.21538.10915122Adverse drug reaction prediction using voting ensemble training approachMilad Besharatifard0Zahra Ghorbanali1Fatemeh Zare Mirakabad2Computational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranComputational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranComputational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), IranIdentifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdfadverse drug reactionmachine learningrandom forestrare adverse reactionsunbalanced dataset |
spellingShingle | Milad Besharatifard Zahra Ghorbanali Fatemeh Zare Mirakabad Adverse drug reaction prediction using voting ensemble training approach AUT Journal of Mathematics and Computing adverse drug reaction machine learning random forest rare adverse reactions unbalanced dataset |
title | Adverse drug reaction prediction using voting ensemble training approach |
title_full | Adverse drug reaction prediction using voting ensemble training approach |
title_fullStr | Adverse drug reaction prediction using voting ensemble training approach |
title_full_unstemmed | Adverse drug reaction prediction using voting ensemble training approach |
title_short | Adverse drug reaction prediction using voting ensemble training approach |
title_sort | adverse drug reaction prediction using voting ensemble training approach |
topic | adverse drug reaction machine learning random forest rare adverse reactions unbalanced dataset |
url | https://ajmc.aut.ac.ir/article_5122_9352bca128b578cfaf2b94905fc960d1.pdf |
work_keys_str_mv | AT miladbesharatifard adversedrugreactionpredictionusingvotingensembletrainingapproach AT zahraghorbanali adversedrugreactionpredictionusingvotingensembletrainingapproach AT fatemehzaremirakabad adversedrugreactionpredictionusingvotingensembletrainingapproach |