Early detection of coronary heart disease using ensemble techniques

Heart disease is one the leading causes of death globally, making the early detection of it crucial. Emerging technologies such as machine learning and deep learning are now being actively used in biomedical care, healthcare, and disease prediction. The focus of this paper is on the prediction of co...

Full description

Bibliographic Details
Main Author: Vardhan Shorewala
Format: Article
Language:English
Published: Elsevier 2021-01-01
Series:Informatics in Medicine Unlocked
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S235291482100143X
_version_ 1798035161076465664
author Vardhan Shorewala
author_facet Vardhan Shorewala
author_sort Vardhan Shorewala
collection DOAJ
description Heart disease is one the leading causes of death globally, making the early detection of it crucial. Emerging technologies such as machine learning and deep learning are now being actively used in biomedical care, healthcare, and disease prediction. The focus of this paper is on the prediction of coronary heart disease (CHD) using a risk factor approach. Predictive techniques such as K-Nearest Neighbors, Binary Logistic Classification, and Naive Bayes are evaluated on the basis of metrics such as accuracy, recall, and ROC curves. These base classifiers are compared against ensemble modelling techniques such as bagging, boosting, and, stacking. A comparitive analytical approach was used to determine how ensemble techniques can be used to improve prediction accuracy of coronary heart disease. The modelling technqiues are tested on the ‘Cardiovascular Disease Dataset,’ which contains 70,000 records of patient data for coronary heart disease. Bagged models are shown to have an averaged increased accuracy of 1.96% in comparison to their traditional counterparts. Boosted models had an average accuracy of 73.4% but had the highest AUC score of 0.73. The stacked model involving KNN, random forest classifier, and SVM proved to be the most effective with a final accuracy of 75.1%. In addition, the perfomance of the tested models was validated using data-analytic technqiues and K-Folds cross-validation.
first_indexed 2024-04-11T20:54:23Z
format Article
id doaj.art-19cb27056f11465d8310bd4f550f05ec
institution Directory Open Access Journal
issn 2352-9148
language English
last_indexed 2024-04-11T20:54:23Z
publishDate 2021-01-01
publisher Elsevier
record_format Article
series Informatics in Medicine Unlocked
spelling doaj.art-19cb27056f11465d8310bd4f550f05ec2022-12-22T04:03:44ZengElsevierInformatics in Medicine Unlocked2352-91482021-01-0126100655Early detection of coronary heart disease using ensemble techniquesVardhan Shorewala0Dhirubhai Ambani International School, Mumbai, IndiaHeart disease is one the leading causes of death globally, making the early detection of it crucial. Emerging technologies such as machine learning and deep learning are now being actively used in biomedical care, healthcare, and disease prediction. The focus of this paper is on the prediction of coronary heart disease (CHD) using a risk factor approach. Predictive techniques such as K-Nearest Neighbors, Binary Logistic Classification, and Naive Bayes are evaluated on the basis of metrics such as accuracy, recall, and ROC curves. These base classifiers are compared against ensemble modelling techniques such as bagging, boosting, and, stacking. A comparitive analytical approach was used to determine how ensemble techniques can be used to improve prediction accuracy of coronary heart disease. The modelling technqiues are tested on the ‘Cardiovascular Disease Dataset,’ which contains 70,000 records of patient data for coronary heart disease. Bagged models are shown to have an averaged increased accuracy of 1.96% in comparison to their traditional counterparts. Boosted models had an average accuracy of 73.4% but had the highest AUC score of 0.73. The stacked model involving KNN, random forest classifier, and SVM proved to be the most effective with a final accuracy of 75.1%. In addition, the perfomance of the tested models was validated using data-analytic technqiues and K-Folds cross-validation.http://www.sciencedirect.com/science/article/pii/S235291482100143XMachine learningHeart diseaseArtificial intelligenceHybrid modellingCoronary heart diseaseEnsemble techniques
spellingShingle Vardhan Shorewala
Early detection of coronary heart disease using ensemble techniques
Informatics in Medicine Unlocked
Machine learning
Heart disease
Artificial intelligence
Hybrid modelling
Coronary heart disease
Ensemble techniques
title Early detection of coronary heart disease using ensemble techniques
title_full Early detection of coronary heart disease using ensemble techniques
title_fullStr Early detection of coronary heart disease using ensemble techniques
title_full_unstemmed Early detection of coronary heart disease using ensemble techniques
title_short Early detection of coronary heart disease using ensemble techniques
title_sort early detection of coronary heart disease using ensemble techniques
topic Machine learning
Heart disease
Artificial intelligence
Hybrid modelling
Coronary heart disease
Ensemble techniques
url http://www.sciencedirect.com/science/article/pii/S235291482100143X
work_keys_str_mv AT vardhanshorewala earlydetectionofcoronaryheartdiseaseusingensembletechniques