Risk factor identification for stroke prognosis using machine-learning algorithms

Stroke is a life-threatening condition causing the second-leading number of deaths worldwide. It is a challenging problem in the public health domain of the 21st century for healthcare professionals and researchers. So, proper monitoring of stroke can prevent and reduce its severity. Risk factor ana...

Full description

Bibliographic Details
Main Author: Tanvir Ahammad
Format: Article
Language:English
Published: Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT) 2022-09-01
Series:Jordanian Journal of Computers and Information Technology
Subjects:
Online Access:http://www.ejmanager.com/fulltextpdf.php?mno=35136
_version_ 1811185656247877632
author Tanvir Ahammad
author_facet Tanvir Ahammad
author_sort Tanvir Ahammad
collection DOAJ
description Stroke is a life-threatening condition causing the second-leading number of deaths worldwide. It is a challenging problem in the public health domain of the 21st century for healthcare professionals and researchers. So, proper monitoring of stroke can prevent and reduce its severity. Risk factor analysis is one of the promising approaches for identifying the presence of stroke disease. Numerous researches have focused on forecasting strokes for patients. The majority had a good accuracy ratio, around 90%, on the publicly available dataset. Combining several preprocessing tasks can considerably increase the quality of classifiers, an area of research need. Additionally, the researchers should pinpoint the major risk factors for stroke disease and use advanced classifiers to forecast the likelihood of stroke. This article presents an enhanced approach for identifying the potential risk factors and predicting the incidence of stroke on a publicly available clinical dataset. The method considers and resolves significant gaps in the previous studies. It incorporates ten classification models, including advanced boosting classifiers, to detect the presence of stroke. The performance of the classifiers is analyzed on all possible subsets of attribute/feature selections concerning five metrics to find the best-performing algorithms. The experimental results demonstrate that the proposed approach achieved the best accuracy on all feature classifications. Overall, this study's main achievement is obtaining a higher percentage (97% accuracy using boosting classifiers) of stroke prognosis than state-of-the-art approaches to stroke dataset. Hence, physicians can use gradient and ensemble boosting-tree-based models that are most suitable for predicting patients' strokes in the real world. Moreover, this investigation also reveals that age, heart disease, glucose level, hypertension, and marital status are the most significant risk factors. At the same time, the remaining attributes are also essential to obtaining the best performance. [JJCIT 2022; 8(3.000): 282-296]
first_indexed 2024-04-11T13:33:58Z
format Article
id doaj.art-1214f424b7914b5da0970d07e3a361bc
institution Directory Open Access Journal
issn 2413-9351
language English
last_indexed 2024-04-11T13:33:58Z
publishDate 2022-09-01
publisher Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
record_format Article
series Jordanian Journal of Computers and Information Technology
spelling doaj.art-1214f424b7914b5da0970d07e3a361bc2022-12-22T04:21:39ZengScientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)Jordanian Journal of Computers and Information Technology2413-93512022-09-018328229610.5455/jjcit.71-165272574635136Risk factor identification for stroke prognosis using machine-learning algorithmsTanvir Ahammad09-10 Chittaranjan Avenue, Dhaka-1100Stroke is a life-threatening condition causing the second-leading number of deaths worldwide. It is a challenging problem in the public health domain of the 21st century for healthcare professionals and researchers. So, proper monitoring of stroke can prevent and reduce its severity. Risk factor analysis is one of the promising approaches for identifying the presence of stroke disease. Numerous researches have focused on forecasting strokes for patients. The majority had a good accuracy ratio, around 90%, on the publicly available dataset. Combining several preprocessing tasks can considerably increase the quality of classifiers, an area of research need. Additionally, the researchers should pinpoint the major risk factors for stroke disease and use advanced classifiers to forecast the likelihood of stroke. This article presents an enhanced approach for identifying the potential risk factors and predicting the incidence of stroke on a publicly available clinical dataset. The method considers and resolves significant gaps in the previous studies. It incorporates ten classification models, including advanced boosting classifiers, to detect the presence of stroke. The performance of the classifiers is analyzed on all possible subsets of attribute/feature selections concerning five metrics to find the best-performing algorithms. The experimental results demonstrate that the proposed approach achieved the best accuracy on all feature classifications. Overall, this study's main achievement is obtaining a higher percentage (97% accuracy using boosting classifiers) of stroke prognosis than state-of-the-art approaches to stroke dataset. Hence, physicians can use gradient and ensemble boosting-tree-based models that are most suitable for predicting patients' strokes in the real world. Moreover, this investigation also reveals that age, heart disease, glucose level, hypertension, and marital status are the most significant risk factors. At the same time, the remaining attributes are also essential to obtaining the best performance. [JJCIT 2022; 8(3.000): 282-296]http://www.ejmanager.com/fulltextpdf.php?mno=35136stroke prediction; machine learning; classification; feature selection; stroke risk factors; healthcare
spellingShingle Tanvir Ahammad
Risk factor identification for stroke prognosis using machine-learning algorithms
Jordanian Journal of Computers and Information Technology
stroke prediction; machine learning; classification; feature selection; stroke risk factors; healthcare
title Risk factor identification for stroke prognosis using machine-learning algorithms
title_full Risk factor identification for stroke prognosis using machine-learning algorithms
title_fullStr Risk factor identification for stroke prognosis using machine-learning algorithms
title_full_unstemmed Risk factor identification for stroke prognosis using machine-learning algorithms
title_short Risk factor identification for stroke prognosis using machine-learning algorithms
title_sort risk factor identification for stroke prognosis using machine learning algorithms
topic stroke prediction; machine learning; classification; feature selection; stroke risk factors; healthcare
url http://www.ejmanager.com/fulltextpdf.php?mno=35136
work_keys_str_mv AT tanvirahammad riskfactoridentificationforstrokeprognosisusingmachinelearningalgorithms