Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables
Cardiovascular disease (CVD) is one of the primary causes of death around the world. This study aimed to identify risk factors associated with CVD mortality using data from the National Health and Nutrition Examination Survey (NHANES). We created three models focusing on dietary data, non-diet-relat...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-09-01
|
Series: | Nutrients |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-6643/15/18/3937 |
_version_ | 1797578302222761984 |
---|---|
author | Agustin Martin-Morales Masaki Yamamoto Mai Inoue Thien Vu Research Dawadi Michihiro Araki |
author_facet | Agustin Martin-Morales Masaki Yamamoto Mai Inoue Thien Vu Research Dawadi Michihiro Araki |
author_sort | Agustin Martin-Morales |
collection | DOAJ |
description | Cardiovascular disease (CVD) is one of the primary causes of death around the world. This study aimed to identify risk factors associated with CVD mortality using data from the National Health and Nutrition Examination Survey (NHANES). We created three models focusing on dietary data, non-diet-related health data, and a combination of both. Machine learning (ML) models, particularly the random forest algorithm, demonstrated robust consistency across health, nutrition, and mixed categories in predicting death from CVD. Shapley additive explanation (SHAP) values showed age, systolic blood pressure, and several other health factors as crucial variables, while fiber, calcium, and vitamin E, among others, were significant nutritional variables. Our research emphasizes the importance of comprehensive health evaluation and dietary intake in predicting CVD mortality. The inclusion of nutrition variables improved the performance of our models, underscoring the utility of dietary intake in ML-based data analysis. Further investigation using large datasets with recurring dietary recalls is necessary to enhance the effectiveness and interpretability of such models. |
first_indexed | 2024-03-10T22:20:00Z |
format | Article |
id | doaj.art-18fe8945d0914a83b75fee35473095c8 |
institution | Directory Open Access Journal |
issn | 2072-6643 |
language | English |
last_indexed | 2024-03-10T22:20:00Z |
publishDate | 2023-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Nutrients |
spelling | doaj.art-18fe8945d0914a83b75fee35473095c82023-11-19T12:17:58ZengMDPI AGNutrients2072-66432023-09-011518393710.3390/nu15183937Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition VariablesAgustin Martin-Morales0Masaki Yamamoto1Mai Inoue2Thien Vu3Research Dawadi4Michihiro Araki5Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanArtificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-shinmachi, Settsu 566-0002, JapanCardiovascular disease (CVD) is one of the primary causes of death around the world. This study aimed to identify risk factors associated with CVD mortality using data from the National Health and Nutrition Examination Survey (NHANES). We created three models focusing on dietary data, non-diet-related health data, and a combination of both. Machine learning (ML) models, particularly the random forest algorithm, demonstrated robust consistency across health, nutrition, and mixed categories in predicting death from CVD. Shapley additive explanation (SHAP) values showed age, systolic blood pressure, and several other health factors as crucial variables, while fiber, calcium, and vitamin E, among others, were significant nutritional variables. Our research emphasizes the importance of comprehensive health evaluation and dietary intake in predicting CVD mortality. The inclusion of nutrition variables improved the performance of our models, underscoring the utility of dietary intake in ML-based data analysis. Further investigation using large datasets with recurring dietary recalls is necessary to enhance the effectiveness and interpretability of such models.https://www.mdpi.com/2072-6643/15/18/3937machine learningcardiovascular diseaseprediction modelnutritiondietary featuresSHAP |
spellingShingle | Agustin Martin-Morales Masaki Yamamoto Mai Inoue Thien Vu Research Dawadi Michihiro Araki Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables Nutrients machine learning cardiovascular disease prediction model nutrition dietary features SHAP |
title | Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables |
title_full | Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables |
title_fullStr | Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables |
title_full_unstemmed | Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables |
title_short | Predicting Cardiovascular Disease Mortality: Leveraging Machine Learning for Comprehensive Assessment of Health and Nutrition Variables |
title_sort | predicting cardiovascular disease mortality leveraging machine learning for comprehensive assessment of health and nutrition variables |
topic | machine learning cardiovascular disease prediction model nutrition dietary features SHAP |
url | https://www.mdpi.com/2072-6643/15/18/3937 |
work_keys_str_mv | AT agustinmartinmorales predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables AT masakiyamamoto predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables AT maiinoue predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables AT thienvu predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables AT researchdawadi predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables AT michihiroaraki predictingcardiovasculardiseasemortalityleveragingmachinelearningforcomprehensiveassessmentofhealthandnutritionvariables |