Data pre-processing for cardiovascular disease classification: A systematic literature review

The important task in the medical field is the early detection of disease. Heart disease is one of the greatest challenging diseases in all other diseases subsequently 17.3 million people died once a year due to heart disease. A minute error in heart disease diagnosis is a risk for an individual li...

Full description

Bibliographic Details
Main Authors: Irfan Javid, Irfan Javid, Ghazali, Rozaida, Muhammad Zulqarnain, Muhammad Zulqarnain, Hassan, Norlida
Format: Article
Language:English
Published: IOS Press 2023
Subjects:
Online Access:http://eprints.uthm.edu.my/10369/1/J16041_b24337ed37f8b9671b8ddb6cb8b4efb5.pdf
_version_ 1796870154317987840
author Irfan Javid, Irfan Javid
Ghazali, Rozaida
Muhammad Zulqarnain, Muhammad Zulqarnain
Hassan, Norlida
author_facet Irfan Javid, Irfan Javid
Ghazali, Rozaida
Muhammad Zulqarnain, Muhammad Zulqarnain
Hassan, Norlida
author_sort Irfan Javid, Irfan Javid
collection UTHM
description The important task in the medical field is the early detection of disease. Heart disease is one of the greatest challenging diseases in all other diseases subsequently 17.3 million people died once a year due to heart disease. A minute error in heart disease diagnosis is a risk for an individual lifespan. Precise heart disease diagnosis is consequently critical. Different approaches including data mining have been used for the prediction of heart disease. However, there are some solemn concerns related to the data quality for example inconsistencies, missing values, noise, high dimensionality, and imbalanced statistics. In order to improve the accuracy of Data Mining based prediction systems, techniques for data preparation were applied to increase the quality of the data. The foremost objective of this paper is to highlight and summarize the research work about (i) data preparation techniques mostly used, (ii) the impact of pre-processing procedures on the accuracy of a heart disease prediction system, (iii) classifier enactment with data pre-processing techniques, (4) comparison in terms of accuracy of the different pre-processing model. A systematic literature review on the use of data pre-processing in heart disease diagnosis is carried out from January 2001 to July 2021 by studying the published material. Almost 30 studies were designated and examined related to the above-mentioned benchmarks. The literature review concludes that data reduction and data cleaning pre-processing techniques are mostly used in heart disease prediction systems. Overall this study concludes that data pre-processing has improved the accuracy of models used for heart disease prediction. Some hybrid models including (ANN+CHI), (ANN+PCA), (DNN+CHI) and (SVM+PCA) have shown improved accuracy level. However, due to the lack of clarification, there is a number of limitations and challenges in order to implementing these models in the real world.
first_indexed 2024-03-05T22:05:20Z
format Article
id uthm.eprints-10369
institution Universiti Tun Hussein Onn Malaysia
language English
last_indexed 2024-03-05T22:05:20Z
publishDate 2023
publisher IOS Press
record_format dspace
spelling uthm.eprints-103692023-10-30T07:35:24Z http://eprints.uthm.edu.my/10369/ Data pre-processing for cardiovascular disease classification: A systematic literature review Irfan Javid, Irfan Javid Ghazali, Rozaida Muhammad Zulqarnain, Muhammad Zulqarnain Hassan, Norlida T Technology (General) The important task in the medical field is the early detection of disease. Heart disease is one of the greatest challenging diseases in all other diseases subsequently 17.3 million people died once a year due to heart disease. A minute error in heart disease diagnosis is a risk for an individual lifespan. Precise heart disease diagnosis is consequently critical. Different approaches including data mining have been used for the prediction of heart disease. However, there are some solemn concerns related to the data quality for example inconsistencies, missing values, noise, high dimensionality, and imbalanced statistics. In order to improve the accuracy of Data Mining based prediction systems, techniques for data preparation were applied to increase the quality of the data. The foremost objective of this paper is to highlight and summarize the research work about (i) data preparation techniques mostly used, (ii) the impact of pre-processing procedures on the accuracy of a heart disease prediction system, (iii) classifier enactment with data pre-processing techniques, (4) comparison in terms of accuracy of the different pre-processing model. A systematic literature review on the use of data pre-processing in heart disease diagnosis is carried out from January 2001 to July 2021 by studying the published material. Almost 30 studies were designated and examined related to the above-mentioned benchmarks. The literature review concludes that data reduction and data cleaning pre-processing techniques are mostly used in heart disease prediction systems. Overall this study concludes that data pre-processing has improved the accuracy of models used for heart disease prediction. Some hybrid models including (ANN+CHI), (ANN+PCA), (DNN+CHI) and (SVM+PCA) have shown improved accuracy level. However, due to the lack of clarification, there is a number of limitations and challenges in order to implementing these models in the real world. IOS Press 2023 Article PeerReviewed text en http://eprints.uthm.edu.my/10369/1/J16041_b24337ed37f8b9671b8ddb6cb8b4efb5.pdf Irfan Javid, Irfan Javid and Ghazali, Rozaida and Muhammad Zulqarnain, Muhammad Zulqarnain and Hassan, Norlida (2023) Data pre-processing for cardiovascular disease classification: A systematic literature review. Journal of Intelligent & Fuzzy Systems, 44. pp. 1525-1545. https://doi.org/10.3233/JIFS-220061
spellingShingle T Technology (General)
Irfan Javid, Irfan Javid
Ghazali, Rozaida
Muhammad Zulqarnain, Muhammad Zulqarnain
Hassan, Norlida
Data pre-processing for cardiovascular disease classification: A systematic literature review
title Data pre-processing for cardiovascular disease classification: A systematic literature review
title_full Data pre-processing for cardiovascular disease classification: A systematic literature review
title_fullStr Data pre-processing for cardiovascular disease classification: A systematic literature review
title_full_unstemmed Data pre-processing for cardiovascular disease classification: A systematic literature review
title_short Data pre-processing for cardiovascular disease classification: A systematic literature review
title_sort data pre processing for cardiovascular disease classification a systematic literature review
topic T Technology (General)
url http://eprints.uthm.edu.my/10369/1/J16041_b24337ed37f8b9671b8ddb6cb8b4efb5.pdf
work_keys_str_mv AT irfanjavidirfanjavid datapreprocessingforcardiovasculardiseaseclassificationasystematicliteraturereview
AT ghazalirozaida datapreprocessingforcardiovasculardiseaseclassificationasystematicliteraturereview
AT muhammadzulqarnainmuhammadzulqarnain datapreprocessingforcardiovasculardiseaseclassificationasystematicliteraturereview
AT hassannorlida datapreprocessingforcardiovasculardiseaseclassificationasystematicliteraturereview