Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients

Introduction. Patients with diabetes are exposed to various cardiovascular risk factors, which lead to an increased risk of cardiac complications. Therefore, the development of a diagnostic system for diabetes and cardiovascular disease (CVD) is a relevant research task. In addition, the identificat...

Full description

Bibliographic Details
Main Authors: A. Mayya, H. Solieman
Format: Article
Language:Russian
Published: Saint Petersburg Electrotechnical University "LETI" 2022-09-01
Series:Известия высших учебных заведений России: Радиоэлектроника
Subjects:
Online Access:https://re.eltech.ru/jour/article/view/667
_version_ 1797874564609343488
author A. Mayya
H. Solieman
author_facet A. Mayya
H. Solieman
author_sort A. Mayya
collection DOAJ
description Introduction. Patients with diabetes are exposed to various cardiovascular risk factors, which lead to an increased risk of cardiac complications. Therefore, the development of a diagnostic system for diabetes and cardiovascular disease (CVD) is a relevant research task. In addition, the identification of the most significant indicators of both diseases may help physicians improve treatment, speed the diagnosis, and decrease its computational costs.Aim. To classify subjects with different diabetes types, predict the risk of cardiovascular diseases in diabetic patients using machine learning methods by finding the correlational indicators.Materials and methods. The NHANES database was used following preprocessing and balancing its data. Machine learning methods were used to classify diabetes based on physical examination data and laboratory data. Feature selection methods were used to derive the most significant indicators for predicting CVD risk in diabetic patients. Performance optimization of the developed classification and prediction models was carried out based on different evaluation metrics.Results. The developed model (Random Forest) achieved the accuracy of 93.1 % (based on laboratory data) and 88 % (based on pysicical examination plus laboratory data). The top five most common predictors in diabetes and prediabetes were found to be glycohemoglobin, basophil count, triglyceride level, waist size, and body mass index (BMI). These results seem logical, since glycohemoglobin is commonly used to check the amount of glucose (sugar) bound to the hemoglobin in the red blood cells. For CVD patients, the most common predictors inlcude eosinophil count (indicative of blood diseases), gamma-glutamyl transferase (GGT), glycohemoglobin, overall oral health, and hand stiffness.Conclusion. Balancing the dataset and deleting NaN values improved the performance of the developed models. The RFC and XGBoost models achieved higher accuracy using gradient descending order to minimize the loss function. The final prediction is made using a weighted majority vote of all the decisions. The result was an automated system for predicting CVD risk in diabetic patients.
first_indexed 2024-04-10T01:32:59Z
format Article
id doaj.art-c24d481e545d43fbb0662f41b8e1c34c
institution Directory Open Access Journal
issn 1993-8985
2658-4794
language Russian
last_indexed 2024-04-10T01:32:59Z
publishDate 2022-09-01
publisher Saint Petersburg Electrotechnical University "LETI"
record_format Article
series Известия высших учебных заведений России: Радиоэлектроника
spelling doaj.art-c24d481e545d43fbb0662f41b8e1c34c2023-03-13T09:20:24ZrusSaint Petersburg Electrotechnical University "LETI"Известия высших учебных заведений России: Радиоэлектроника1993-89852658-47942022-09-0125411612210.32603/1993-8985-2022-25-4-116-122464Machine Learning System for Predicting Cardiovascular Disorders in Diabetic PatientsA. Mayya0H. Solieman1Saint Petersburg Electrotechnical University; Tishreen UniversitySaint Petersburg Electrotechnical University; Tishreen UniversityIntroduction. Patients with diabetes are exposed to various cardiovascular risk factors, which lead to an increased risk of cardiac complications. Therefore, the development of a diagnostic system for diabetes and cardiovascular disease (CVD) is a relevant research task. In addition, the identification of the most significant indicators of both diseases may help physicians improve treatment, speed the diagnosis, and decrease its computational costs.Aim. To classify subjects with different diabetes types, predict the risk of cardiovascular diseases in diabetic patients using machine learning methods by finding the correlational indicators.Materials and methods. The NHANES database was used following preprocessing and balancing its data. Machine learning methods were used to classify diabetes based on physical examination data and laboratory data. Feature selection methods were used to derive the most significant indicators for predicting CVD risk in diabetic patients. Performance optimization of the developed classification and prediction models was carried out based on different evaluation metrics.Results. The developed model (Random Forest) achieved the accuracy of 93.1 % (based on laboratory data) and 88 % (based on pysicical examination plus laboratory data). The top five most common predictors in diabetes and prediabetes were found to be glycohemoglobin, basophil count, triglyceride level, waist size, and body mass index (BMI). These results seem logical, since glycohemoglobin is commonly used to check the amount of glucose (sugar) bound to the hemoglobin in the red blood cells. For CVD patients, the most common predictors inlcude eosinophil count (indicative of blood diseases), gamma-glutamyl transferase (GGT), glycohemoglobin, overall oral health, and hand stiffness.Conclusion. Balancing the dataset and deleting NaN values improved the performance of the developed models. The RFC and XGBoost models achieved higher accuracy using gradient descending order to minimize the loss function. The final prediction is made using a weighted majority vote of all the decisions. The result was an automated system for predicting CVD risk in diabetic patients.https://re.eltech.ru/jour/article/view/667cardiovascular disordersdiabetesmachine learningpreprocessingfeature selectionmethods evaluationcorrelational analysis
spellingShingle A. Mayya
H. Solieman
Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients
Известия высших учебных заведений России: Радиоэлектроника
cardiovascular disorders
diabetes
machine learning
preprocessing
feature selection
methods evaluation
correlational analysis
title Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients
title_full Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients
title_fullStr Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients
title_full_unstemmed Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients
title_short Machine Learning System for Predicting Cardiovascular Disorders in Diabetic Patients
title_sort machine learning system for predicting cardiovascular disorders in diabetic patients
topic cardiovascular disorders
diabetes
machine learning
preprocessing
feature selection
methods evaluation
correlational analysis
url https://re.eltech.ru/jour/article/view/667
work_keys_str_mv AT amayya machinelearningsystemforpredictingcardiovasculardisordersindiabeticpatients
AT hsolieman machinelearningsystemforpredictingcardiovasculardisordersindiabeticpatients