Early detection of coronary heart disease using ensemble techniques
Heart disease is one the leading causes of death globally, making the early detection of it crucial. Emerging technologies such as machine learning and deep learning are now being actively used in biomedical care, healthcare, and disease prediction. The focus of this paper is on the prediction of co...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2021-01-01
|
Series: | Informatics in Medicine Unlocked |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S235291482100143X |
_version_ | 1798035161076465664 |
---|---|
author | Vardhan Shorewala |
author_facet | Vardhan Shorewala |
author_sort | Vardhan Shorewala |
collection | DOAJ |
description | Heart disease is one the leading causes of death globally, making the early detection of it crucial. Emerging technologies such as machine learning and deep learning are now being actively used in biomedical care, healthcare, and disease prediction. The focus of this paper is on the prediction of coronary heart disease (CHD) using a risk factor approach. Predictive techniques such as K-Nearest Neighbors, Binary Logistic Classification, and Naive Bayes are evaluated on the basis of metrics such as accuracy, recall, and ROC curves. These base classifiers are compared against ensemble modelling techniques such as bagging, boosting, and, stacking. A comparitive analytical approach was used to determine how ensemble techniques can be used to improve prediction accuracy of coronary heart disease. The modelling technqiues are tested on the ‘Cardiovascular Disease Dataset,’ which contains 70,000 records of patient data for coronary heart disease. Bagged models are shown to have an averaged increased accuracy of 1.96% in comparison to their traditional counterparts. Boosted models had an average accuracy of 73.4% but had the highest AUC score of 0.73. The stacked model involving KNN, random forest classifier, and SVM proved to be the most effective with a final accuracy of 75.1%. In addition, the perfomance of the tested models was validated using data-analytic technqiues and K-Folds cross-validation. |
first_indexed | 2024-04-11T20:54:23Z |
format | Article |
id | doaj.art-19cb27056f11465d8310bd4f550f05ec |
institution | Directory Open Access Journal |
issn | 2352-9148 |
language | English |
last_indexed | 2024-04-11T20:54:23Z |
publishDate | 2021-01-01 |
publisher | Elsevier |
record_format | Article |
series | Informatics in Medicine Unlocked |
spelling | doaj.art-19cb27056f11465d8310bd4f550f05ec2022-12-22T04:03:44ZengElsevierInformatics in Medicine Unlocked2352-91482021-01-0126100655Early detection of coronary heart disease using ensemble techniquesVardhan Shorewala0Dhirubhai Ambani International School, Mumbai, IndiaHeart disease is one the leading causes of death globally, making the early detection of it crucial. Emerging technologies such as machine learning and deep learning are now being actively used in biomedical care, healthcare, and disease prediction. The focus of this paper is on the prediction of coronary heart disease (CHD) using a risk factor approach. Predictive techniques such as K-Nearest Neighbors, Binary Logistic Classification, and Naive Bayes are evaluated on the basis of metrics such as accuracy, recall, and ROC curves. These base classifiers are compared against ensemble modelling techniques such as bagging, boosting, and, stacking. A comparitive analytical approach was used to determine how ensemble techniques can be used to improve prediction accuracy of coronary heart disease. The modelling technqiues are tested on the ‘Cardiovascular Disease Dataset,’ which contains 70,000 records of patient data for coronary heart disease. Bagged models are shown to have an averaged increased accuracy of 1.96% in comparison to their traditional counterparts. Boosted models had an average accuracy of 73.4% but had the highest AUC score of 0.73. The stacked model involving KNN, random forest classifier, and SVM proved to be the most effective with a final accuracy of 75.1%. In addition, the perfomance of the tested models was validated using data-analytic technqiues and K-Folds cross-validation.http://www.sciencedirect.com/science/article/pii/S235291482100143XMachine learningHeart diseaseArtificial intelligenceHybrid modellingCoronary heart diseaseEnsemble techniques |
spellingShingle | Vardhan Shorewala Early detection of coronary heart disease using ensemble techniques Informatics in Medicine Unlocked Machine learning Heart disease Artificial intelligence Hybrid modelling Coronary heart disease Ensemble techniques |
title | Early detection of coronary heart disease using ensemble techniques |
title_full | Early detection of coronary heart disease using ensemble techniques |
title_fullStr | Early detection of coronary heart disease using ensemble techniques |
title_full_unstemmed | Early detection of coronary heart disease using ensemble techniques |
title_short | Early detection of coronary heart disease using ensemble techniques |
title_sort | early detection of coronary heart disease using ensemble techniques |
topic | Machine learning Heart disease Artificial intelligence Hybrid modelling Coronary heart disease Ensemble techniques |
url | http://www.sciencedirect.com/science/article/pii/S235291482100143X |
work_keys_str_mv | AT vardhanshorewala earlydetectionofcoronaryheartdiseaseusingensembletechniques |