Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables

Cardiovascular disease (CVD) is one of the primary causes of death around the world. This study aimed to identify risk factors associated with CVD mortality using data from the National Health and Nutrition Examination Survey (NHANES). We created three models focusing on dietary data, non-diet-relat...

Full description

Bibliographic Details
Main Authors: Agustin Martin-Morales, Masaki Yamamoto, Mai Inoue, Thien Vu, Research Dawadi, Michihiro Araki
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Nutrients
Subjects:
Online Access:https://www.mdpi.com/2072-6643/15/18/3937
_version_ 1797578302222761984
author Agustin Martin-Morales
Masaki Yamamoto
Mai Inoue
Thien Vu
Research Dawadi
Michihiro Araki
author_facet Agustin Martin-Morales
Masaki Yamamoto
Mai Inoue
Thien Vu
Research Dawadi
Michihiro Araki
author_sort Agustin Martin-Morales
collection DOAJ
description Cardiovascular disease (CVD) is one of the primary causes of death around the world. This study aimed to identify risk factors associated with CVD mortality using data from the National Health and Nutrition Examination Survey (NHANES). We created three models focusing on dietary data, non-diet-related health data, and a combination of both. Machine learning (ML) models, particularly the random forest algorithm, demonstrated robust consistency across health, nutrition, and mixed categories in predicting death from CVD. Shapley additive explanation (SHAP) values showed age, systolic blood pressure, and several other health factors as crucial variables, while fiber, calcium, and vitamin E, among others, were significant nutritional variables. Our research emphasizes the importance of comprehensive health evaluation and dietary intake in predicting CVD mortality. The inclusion of nutrition variables improved the performance of our models, underscoring the utility of dietary intake in ML-based data analysis. Further investigation using large datasets with recurring dietary recalls is necessary to enhance the effectiveness and interpretability of such models.
first_indexed 2024-03-10T22:20:00Z
format Article
id doaj.art-18fe8945d0914a83b75fee35473095c8
institution Directory Open Access Journal
issn 2072-6643
language English
last_indexed 2024-03-10T22:20:00Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Nutrients
spelling doaj.art-18fe8945d0914a83b75fee35473095c82023-11-19T12:17:58ZengMDPI AGNutrients2072-66432023-09-011518393710.3390/nu15183937Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition VariablesAgustin Martin-Morales0Masaki Yamamoto1Mai Inoue2Thien Vu3Research Dawadi4Michihiro Araki5Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanCardiovascular disease (CVD) is one of the primary causes of death around the world. This study aimed to identify risk factors associated with CVD mortality using data from the National Health and Nutrition Examination Survey (NHANES). We created three models focusing on dietary data, non-diet-related health data, and a combination of both. Machine learning (ML) models, particularly the random forest algorithm, demonstrated robust consistency across health, nutrition, and mixed categories in predicting death from CVD. Shapley additive explanation (SHAP) values showed age, systolic blood pressure, and several other health factors as crucial variables, while fiber, calcium, and vitamin E, among others, were significant nutritional variables. Our research emphasizes the importance of comprehensive health evaluation and dietary intake in predicting CVD mortality. The inclusion of nutrition variables improved the performance of our models, underscoring the utility of dietary intake in ML-based data analysis. Further investigation using large datasets with recurring dietary recalls is necessary to enhance the effectiveness and interpretability of such models.https://www.mdpi.com/2072-6643/15/18/3937machine learningcardiovascular diseaseprediction modelnutritiondietary featuresSHAP
spellingShingle Agustin Martin-Morales
Masaki Yamamoto
Mai Inoue
Thien Vu
Research Dawadi
Michihiro Araki
Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
Nutrients
machine learning
cardiovascular disease
prediction model
nutrition
dietary features
SHAP
title Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
title_full Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
title_fullStr Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
title_full_unstemmed Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
title_short Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
title_sort predicting cardiovascular disease mortality leveraging machine learning for comprehensive assessment of health and nutrition variables
topic machine learning
cardiovascular disease
prediction model
nutrition
dietary features
SHAP
url https://www.mdpi.com/2072-6643/15/18/3937
work_keys_str_mv AT agustinmartinmorales predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables
AT masakiyamamoto predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables
AT maiinoue predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables
AT thienvu predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables
AT researchdawadi predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables
AT michihiroaraki predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables