Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine

IntroductionThe COVID-19 pandemic has exacerbated mental health challenges, particularly depression among college students. Detecting at-risk students early is crucial but remains challenging, particularly in developing countries. Utilizing data-driven predictive models presents a viable solution to...

Full description

Bibliographic Details
Main Authors: Lorena Cecilia López Steinmetz, Margarita Sison, Rustam Zhumagambetov, Juan Carlos Godoy, Stefan Haufe
Format: Article
Language:English
Published: Frontiers Media S.A. 2024-04-01
Series:Frontiers in Psychiatry
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpsyt.2024.1376784/full
_version_ 1827281136166895616
author Lorena Cecilia López Steinmetz
Lorena Cecilia López Steinmetz
Margarita Sison
Rustam Zhumagambetov
Juan Carlos Godoy
Stefan Haufe
Stefan Haufe
Stefan Haufe
Stefan Haufe
author_facet Lorena Cecilia López Steinmetz
Lorena Cecilia López Steinmetz
Margarita Sison
Rustam Zhumagambetov
Juan Carlos Godoy
Stefan Haufe
Stefan Haufe
Stefan Haufe
Stefan Haufe
author_sort Lorena Cecilia López Steinmetz
collection DOAJ
description IntroductionThe COVID-19 pandemic has exacerbated mental health challenges, particularly depression among college students. Detecting at-risk students early is crucial but remains challenging, particularly in developing countries. Utilizing data-driven predictive models presents a viable solution to address this pressing need.Aims1) To develop and compare machine learning (ML) models for predicting depression in Argentinean students during the pandemic. 2) To assess the performance of classification and regression models using appropriate metrics. 3) To identify key features driving depression prediction.MethodsA longitudinal dataset (N = 1492 college students) captured T1 and T2 measurements during the Argentinean COVID-19 quarantine. ML models, including linear logistic regression classifiers/ridge regression (LogReg/RR), random forest classifiers/regressors, and support vector machines/regressors (SVM/SVR), are employed. Assessed features encompass depression and anxiety scores (at T1), mental disorder/suicidal behavior history, quarantine sub-period information, sex, and age. For classification, models’ performance on test data is evaluated using Area Under the Precision-Recall Curve (AUPRC), Area Under the Receiver Operating Characteristic curve, Balanced Accuracy, F1 score, and Brier loss. For regression, R-squared (R2), Mean Absolute Error, and Mean Squared Error are assessed. Univariate analyses are conducted to assess the predictive strength of each individual feature with respect to the target variable. The performance of multi- vs univariate models is compared using the mean AUPRC score for classifiers and the R2 score for regressors.ResultsThe highest performance is achieved by SVM and LogReg (e.g., AUPRC: 0.76, 95% CI: 0.69, 0.81) and SVR and RR models (e.g., R2 for SVR and RR: 0.56, 95% CI: 0.45, 0.64 and 0.45, 0.63, respectively). Univariate models, particularly LogReg and SVM using depression (AUPRC: 0.72, 95% CI: 0.64, 0.79) or anxiety scores (AUPRC: 0.71, 95% CI: 0.64, 0.78) and RR using depression scores (R2: 0.48, 95% CI: 0.39, 0.57) exhibit performance levels close to those of the multivariate models, which include all features.DiscussionThese findings highlight the relevance of pre-existing depression and anxiety conditions in predicting depression during quarantine, underscoring their comorbidity. ML models, particularly SVM/SVR and LogReg/RR, demonstrate potential in the timely detection of at-risk students. However, further studies are needed before clinical implementation.
first_indexed 2024-04-24T08:54:10Z
format Article
id doaj.art-3d08726a04c1417ba78e056330d088e2
institution Directory Open Access Journal
issn 1664-0640
language English
last_indexed 2024-04-24T08:54:10Z
publishDate 2024-04-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Psychiatry
spelling doaj.art-3d08726a04c1417ba78e056330d088e22024-04-16T08:16:05ZengFrontiers Media S.A.Frontiers in Psychiatry1664-06402024-04-011510.3389/fpsyt.2024.13767841376784Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantineLorena Cecilia López Steinmetz0Lorena Cecilia López Steinmetz1Margarita Sison2Rustam Zhumagambetov3Juan Carlos Godoy4Stefan Haufe5Stefan Haufe6Stefan Haufe7Stefan Haufe8Inverse Modeling and Machine Learning, Chair of Uncertainty, Institute of Software Engineering and Theoretical Computer Science, Faculty IV Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, GermanyInstituto de Investigaciones Psicológicas (IIPsi), Facultad de Psicología, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad Nacional de Córdoba (UNC), Córdoba, ArgentinaBerlin Center for Advanced Neuroimaging (BCAN), Charité – Universitätsmedizin Berlin, Berlin, GermanyWorking Group 8.44 Machine Learning and Uncertainty, Mathematical Modelling and Data Analysis Department, Physikalisch-Technische Bundesanstalt Braunschweig und Berlin, Berlin, GermanyInstituto de Investigaciones Psicológicas (IIPsi), Facultad de Psicología, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad Nacional de Córdoba (UNC), Córdoba, ArgentinaInverse Modeling and Machine Learning, Chair of Uncertainty, Institute of Software Engineering and Theoretical Computer Science, Faculty IV Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, GermanyBerlin Center for Advanced Neuroimaging (BCAN), Charité – Universitätsmedizin Berlin, Berlin, GermanyWorking Group 8.44 Machine Learning and Uncertainty, Mathematical Modelling and Data Analysis Department, Physikalisch-Technische Bundesanstalt Braunschweig und Berlin, Berlin, GermanyInstitute for Medical Informatics, Charité – Universitätsmedizin Berlin, Berlin, GermanyIntroductionThe COVID-19 pandemic has exacerbated mental health challenges, particularly depression among college students. Detecting at-risk students early is crucial but remains challenging, particularly in developing countries. Utilizing data-driven predictive models presents a viable solution to address this pressing need.Aims1) To develop and compare machine learning (ML) models for predicting depression in Argentinean students during the pandemic. 2) To assess the performance of classification and regression models using appropriate metrics. 3) To identify key features driving depression prediction.MethodsA longitudinal dataset (N = 1492 college students) captured T1 and T2 measurements during the Argentinean COVID-19 quarantine. ML models, including linear logistic regression classifiers/ridge regression (LogReg/RR), random forest classifiers/regressors, and support vector machines/regressors (SVM/SVR), are employed. Assessed features encompass depression and anxiety scores (at T1), mental disorder/suicidal behavior history, quarantine sub-period information, sex, and age. For classification, models’ performance on test data is evaluated using Area Under the Precision-Recall Curve (AUPRC), Area Under the Receiver Operating Characteristic curve, Balanced Accuracy, F1 score, and Brier loss. For regression, R-squared (R2), Mean Absolute Error, and Mean Squared Error are assessed. Univariate analyses are conducted to assess the predictive strength of each individual feature with respect to the target variable. The performance of multi- vs univariate models is compared using the mean AUPRC score for classifiers and the R2 score for regressors.ResultsThe highest performance is achieved by SVM and LogReg (e.g., AUPRC: 0.76, 95% CI: 0.69, 0.81) and SVR and RR models (e.g., R2 for SVR and RR: 0.56, 95% CI: 0.45, 0.64 and 0.45, 0.63, respectively). Univariate models, particularly LogReg and SVM using depression (AUPRC: 0.72, 95% CI: 0.64, 0.79) or anxiety scores (AUPRC: 0.71, 95% CI: 0.64, 0.78) and RR using depression scores (R2: 0.48, 95% CI: 0.39, 0.57) exhibit performance levels close to those of the multivariate models, which include all features.DiscussionThese findings highlight the relevance of pre-existing depression and anxiety conditions in predicting depression during quarantine, underscoring their comorbidity. ML models, particularly SVM/SVR and LogReg/RR, demonstrate potential in the timely detection of at-risk students. However, further studies are needed before clinical implementation.https://www.frontiersin.org/articles/10.3389/fpsyt.2024.1376784/fulldepression predictionCOVID-19 pandemicmachine learningclassificationregressioncollege students
spellingShingle Lorena Cecilia López Steinmetz
Lorena Cecilia López Steinmetz
Margarita Sison
Rustam Zhumagambetov
Juan Carlos Godoy
Stefan Haufe
Stefan Haufe
Stefan Haufe
Stefan Haufe
Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine
Frontiers in Psychiatry
depression prediction
COVID-19 pandemic
machine learning
classification
regression
college students
title Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine
title_full Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine
title_fullStr Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine
title_full_unstemmed Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine
title_short Machine learning models predict the emergence of depression in Argentinean college students during periods of COVID-19 quarantine
title_sort machine learning models predict the emergence of depression in argentinean college students during periods of covid 19 quarantine
topic depression prediction
COVID-19 pandemic
machine learning
classification
regression
college students
url https://www.frontiersin.org/articles/10.3389/fpsyt.2024.1376784/full
work_keys_str_mv AT lorenacecilialopezsteinmetz machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT lorenacecilialopezsteinmetz machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT margaritasison machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT rustamzhumagambetov machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT juancarlosgodoy machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT stefanhaufe machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT stefanhaufe machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT stefanhaufe machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine
AT stefanhaufe machinelearningmodelspredicttheemergenceofdepressioninargentineancollegestudentsduringperiodsofcovid19quarantine