Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis

Our study aims to develop an effective integrated machine learning (ML) scheme to predict vascular events and bleeding in patients with nonvalvular atrial fibrillation taking dabigatran and identify important risk factors. This study is a post-hoc analysis from the Randomized Evaluation of Long-Term...

Full description

Bibliographic Details
Main Authors: Yung-Chuan Huang, Yu-Chen Cheng, Mao-Jhen Jhou, Mingchih Chen, Chi-Jie Lu
Format: Article
Language:English
Published: MDPI AG 2022-05-01
Series:Journal of Personalized Medicine
Subjects:
Online Access:https://www.mdpi.com/2075-4426/12/5/756
_version_ 1797498610509676544
author Yung-Chuan Huang
Yu-Chen Cheng
Mao-Jhen Jhou
Mingchih Chen
Chi-Jie Lu
author_facet Yung-Chuan Huang
Yu-Chen Cheng
Mao-Jhen Jhou
Mingchih Chen
Chi-Jie Lu
author_sort Yung-Chuan Huang
collection DOAJ
description Our study aims to develop an effective integrated machine learning (ML) scheme to predict vascular events and bleeding in patients with nonvalvular atrial fibrillation taking dabigatran and identify important risk factors. This study is a post-hoc analysis from the Randomized Evaluation of Long-Term Anticoagulant Therapy trial database. One traditional prediction method, logistic regression (LGR), and four ML techniques—naive Bayes, random forest (RF), classification and regression tree, and extreme gradient boosting (XGBoost)—were combined to construct our scheme. Area under the receiver operating characteristic curve (AUC) of RF (0.780) and XGBoost (0.717) was higher than that of LGR (0.674) in predicting vascular events. In predicting bleeding, AUC of RF (0.684) and XGBoost (0.618) showed higher values than those generated by LGR (0.605). Our integrated ML feature selection scheme based on the two convincing prediction techniques identified age, history of congestive heart failure and myocardial infarction, smoking, kidney function, and body mass index as major variables of vascular events; age, kidney function, smoking, bleeding history, concomitant use of specific drugs, and dabigatran dosage as major variables of bleeding. ML is an effective data analysis algorithm for solving complex medical data. Our results may provide preliminary direction for precision medicine.
first_indexed 2024-03-10T03:35:48Z
format Article
id doaj.art-851aaa8033a14ff08d2dc5d2bc0a6c07
institution Directory Open Access Journal
issn 2075-4426
language English
last_indexed 2024-03-10T03:35:48Z
publishDate 2022-05-01
publisher MDPI AG
record_format Article
series Journal of Personalized Medicine
spelling doaj.art-851aaa8033a14ff08d2dc5d2bc0a6c072023-11-23T11:44:14ZengMDPI AGJournal of Personalized Medicine2075-44262022-05-0112575610.3390/jpm12050756Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc AnalysisYung-Chuan Huang0Yu-Chen Cheng1Mao-Jhen Jhou2Mingchih Chen3Chi-Jie Lu4Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, TaiwanDepartment of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City 24352, TaiwanGraduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, TaiwanGraduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, TaiwanGraduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, TaiwanOur study aims to develop an effective integrated machine learning (ML) scheme to predict vascular events and bleeding in patients with nonvalvular atrial fibrillation taking dabigatran and identify important risk factors. This study is a post-hoc analysis from the Randomized Evaluation of Long-Term Anticoagulant Therapy trial database. One traditional prediction method, logistic regression (LGR), and four ML techniques—naive Bayes, random forest (RF), classification and regression tree, and extreme gradient boosting (XGBoost)—were combined to construct our scheme. Area under the receiver operating characteristic curve (AUC) of RF (0.780) and XGBoost (0.717) was higher than that of LGR (0.674) in predicting vascular events. In predicting bleeding, AUC of RF (0.684) and XGBoost (0.618) showed higher values than those generated by LGR (0.605). Our integrated ML feature selection scheme based on the two convincing prediction techniques identified age, history of congestive heart failure and myocardial infarction, smoking, kidney function, and body mass index as major variables of vascular events; age, kidney function, smoking, bleeding history, concomitant use of specific drugs, and dabigatran dosage as major variables of bleeding. ML is an effective data analysis algorithm for solving complex medical data. Our results may provide preliminary direction for precision medicine.https://www.mdpi.com/2075-4426/12/5/756arrhythmiacardioembolic strokenon-vitamin K antagonist oral anticoagulantsdabigatranmachine learning
spellingShingle Yung-Chuan Huang
Yu-Chen Cheng
Mao-Jhen Jhou
Mingchih Chen
Chi-Jie Lu
Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis
Journal of Personalized Medicine
arrhythmia
cardioembolic stroke
non-vitamin K antagonist oral anticoagulants
dabigatran
machine learning
title Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis
title_full Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis
title_fullStr Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis
title_full_unstemmed Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis
title_short Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme—A Post Hoc Analysis
title_sort important risk factors in patients with nonvalvular atrial fibrillation taking dabigatran using integrated machine learning scheme a post hoc analysis
topic arrhythmia
cardioembolic stroke
non-vitamin K antagonist oral anticoagulants
dabigatran
machine learning
url https://www.mdpi.com/2075-4426/12/5/756
work_keys_str_mv AT yungchuanhuang importantriskfactorsinpatientswithnonvalvularatrialfibrillationtakingdabigatranusingintegratedmachinelearningschemeaposthocanalysis
AT yuchencheng importantriskfactorsinpatientswithnonvalvularatrialfibrillationtakingdabigatranusingintegratedmachinelearningschemeaposthocanalysis
AT maojhenjhou importantriskfactorsinpatientswithnonvalvularatrialfibrillationtakingdabigatranusingintegratedmachinelearningschemeaposthocanalysis
AT mingchihchen importantriskfactorsinpatientswithnonvalvularatrialfibrillationtakingdabigatranusingintegratedmachinelearningschemeaposthocanalysis
AT chijielu importantriskfactorsinpatientswithnonvalvularatrialfibrillationtakingdabigatranusingintegratedmachinelearningschemeaposthocanalysis