Evaluation of nutritional status and clinical depression classification using an explainable machine learning method

IntroductionDepression is a prevalent disorder worldwide, with potentially severe implications. It contributes significantly to an increased risk of diseases associated with multiple risk factors. Early accurate diagnosis of depressive symptoms is a critical first step toward management, interventio...

Full description

Bibliographic Details
Main Authors:	Payam Hosseinzadeh Kasani, Jung Eun Lee, Chihyun Park, Cheol-Heui Yun, Jae-Won Jang, Sang-Ah Lee
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2023-05-01
Series:	Frontiers in Nutrition
Subjects:	depression nutrition machine learning classification interpretability clinical depression
Online Access:	https://www.frontiersin.org/articles/10.3389/fnut.2023.1165854/full

_version_	1797830997742452736
author	Payam Hosseinzadeh Kasani Payam Hosseinzadeh Kasani Jung Eun Lee Chihyun Park Chihyun Park Cheol-Heui Yun Cheol-Heui Yun Jae-Won Jang Jae-Won Jang Sang-Ah Lee Sang-Ah Lee
author_facet	Payam Hosseinzadeh Kasani Payam Hosseinzadeh Kasani Jung Eun Lee Chihyun Park Chihyun Park Cheol-Heui Yun Cheol-Heui Yun Jae-Won Jang Jae-Won Jang Sang-Ah Lee Sang-Ah Lee
author_sort	Payam Hosseinzadeh Kasani
collection	DOAJ
description	IntroductionDepression is a prevalent disorder worldwide, with potentially severe implications. It contributes significantly to an increased risk of diseases associated with multiple risk factors. Early accurate diagnosis of depressive symptoms is a critical first step toward management, intervention, and prevention. Various nutritional and dietary compounds have been suggested to be involved in the onset, maintenance, and severity of depressive disorders. Despite the challenges to better understanding the association between nutritional risk factors and the occurrence of depression, assessing the interplay of these markers through supervised machine learning remains to be fully explored.MethodsThis study aimed to determine the ability of machine learning-based decision support methods to identify the presence of depression using publicly available health data from the Korean National Health and Nutrition Examination Survey. Two exploration techniques, namely, uniform manifold approximation and projection and Pearson correlation, were performed for explanatory analysis among datasets. A grid search optimization with cross-validation was performed to fine-tune the models for classifying depression with the highest accuracy. Several performance measures, including accuracy, precision, recall, F1 score, confusion matrix, areas under the precision-recall and receiver operating characteristic curves, and calibration plot, were used to compare classifier performances. We further investigated the importance of the features provided: visualized interpretation using ELI5, partial dependence plots, and local interpretable using model-agnostic explanations and Shapley additive explanation for the prediction at both the population and individual levels.ResultsThe best model achieved an accuracy of 86.18% for XGBoost and an area under the curve of 84.96% for the random forest model in original dataset and the XGBoost algorithm with an accuracy of 86.02% and an area under the curve of 85.34% in the quantile-based dataset. The explainable results revealed a complementary observation of the relative changes in feature values, and, thus, the importance of emergent depression risks could be identified.DiscussionThe strength of our approach is the large sample size used for training with a fine-tuned model. The machine learning-based analysis showed that the hyper-tuned model has empirically higher accuracy in classifying patients with depressive disorder, as evidenced by the set of interpretable experiments, and can be an effective solution for disease control.
first_indexed	2024-04-09T13:45:26Z
format	Article
id	doaj.art-e75740c9e5e4471ea9b9215794a807b5
institution	Directory Open Access Journal
issn	2296-861X
language	English
last_indexed	2024-04-09T13:45:26Z
publishDate	2023-05-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Nutrition
spelling	doaj.art-e75740c9e5e4471ea9b9215794a807b52023-05-09T05:12:21ZengFrontiers Media S.A.Frontiers in Nutrition2296-861X2023-05-011010.3389/fnut.2023.11658541165854Evaluation of nutritional status and clinical depression classification using an explainable machine learning methodPayam Hosseinzadeh Kasani0Payam Hosseinzadeh Kasani1Jung Eun Lee2Chihyun Park3Chihyun Park4Cheol-Heui Yun5Cheol-Heui Yun6Jae-Won Jang7Jae-Won Jang8Sang-Ah Lee9Sang-Ah Lee10Department of Neurology, Kangwon National University Hospital, Chuncheon, Republic of KoreaInterdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of KoreaInterdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of KoreaInterdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of KoreaDepartment of Computer Science and Engineering, Kangwon National University, Chuncheon, Republic of KoreaDepartment of Agricultural Biotechnology, Seoul National University, Seoul, Republic of KoreaResearch Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of KoreaDepartment of Neurology, Kangwon National University Hospital, Chuncheon, Republic of KoreaDepartment of Neurology, Kangwon National University School of Medicine, Chuncheon, Republic of KoreaInterdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of KoreaDepartment of Preventive Medicine, College of Medicine, Kangwon National University, Chuncheon, Republic of KoreaIntroductionDepression is a prevalent disorder worldwide, with potentially severe implications. It contributes significantly to an increased risk of diseases associated with multiple risk factors. Early accurate diagnosis of depressive symptoms is a critical first step toward management, intervention, and prevention. Various nutritional and dietary compounds have been suggested to be involved in the onset, maintenance, and severity of depressive disorders. Despite the challenges to better understanding the association between nutritional risk factors and the occurrence of depression, assessing the interplay of these markers through supervised machine learning remains to be fully explored.MethodsThis study aimed to determine the ability of machine learning-based decision support methods to identify the presence of depression using publicly available health data from the Korean National Health and Nutrition Examination Survey. Two exploration techniques, namely, uniform manifold approximation and projection and Pearson correlation, were performed for explanatory analysis among datasets. A grid search optimization with cross-validation was performed to fine-tune the models for classifying depression with the highest accuracy. Several performance measures, including accuracy, precision, recall, F1 score, confusion matrix, areas under the precision-recall and receiver operating characteristic curves, and calibration plot, were used to compare classifier performances. We further investigated the importance of the features provided: visualized interpretation using ELI5, partial dependence plots, and local interpretable using model-agnostic explanations and Shapley additive explanation for the prediction at both the population and individual levels.ResultsThe best model achieved an accuracy of 86.18% for XGBoost and an area under the curve of 84.96% for the random forest model in original dataset and the XGBoost algorithm with an accuracy of 86.02% and an area under the curve of 85.34% in the quantile-based dataset. The explainable results revealed a complementary observation of the relative changes in feature values, and, thus, the importance of emergent depression risks could be identified.DiscussionThe strength of our approach is the large sample size used for training with a fine-tuned model. The machine learning-based analysis showed that the hyper-tuned model has empirically higher accuracy in classifying patients with depressive disorder, as evidenced by the set of interpretable experiments, and can be an effective solution for disease control.https://www.frontiersin.org/articles/10.3389/fnut.2023.1165854/fulldepressionnutritionmachine learningclassificationinterpretabilityclinical depression
spellingShingle	Payam Hosseinzadeh Kasani Payam Hosseinzadeh Kasani Jung Eun Lee Chihyun Park Chihyun Park Cheol-Heui Yun Cheol-Heui Yun Jae-Won Jang Jae-Won Jang Sang-Ah Lee Sang-Ah Lee Evaluation of nutritional status and clinical depression classification using an explainable machine learning method Frontiers in Nutrition depression nutrition machine learning classification interpretability clinical depression
title	Evaluation of nutritional status and clinical depression classification using an explainable machine learning method
title_full	Evaluation of nutritional status and clinical depression classification using an explainable machine learning method
title_fullStr	Evaluation of nutritional status and clinical depression classification using an explainable machine learning method
title_full_unstemmed	Evaluation of nutritional status and clinical depression classification using an explainable machine learning method
title_short	Evaluation of nutritional status and clinical depression classification using an explainable machine learning method
title_sort	evaluation of nutritional status and clinical depression classification using an explainable machine learning method
topic	depression nutrition machine learning classification interpretability clinical depression
url	https://www.frontiersin.org/articles/10.3389/fnut.2023.1165854/full
work_keys_str_mv	AT payamhosseinzadehkasani evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT payamhosseinzadehkasani evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT jungeunlee evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT chihyunpark evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT chihyunpark evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT cheolheuiyun evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT cheolheuiyun evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT jaewonjang evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT jaewonjang evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT sangahlee evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod AT sangahlee evaluationofnutritionalstatusandclinicaldepressionclassificationusinganexplainablemachinelearningmethod

Evaluation of nutritional status and clinical depression classification using an explainable machine learning method

Similar Items