Evaluating machine learning for predicting youth suicidal behavior up to 1 year after contact with mental-health specialty care

In this article, we assessed the performance of several predictive modeling algorithms of suicide attempt resulting in inpatient hospitalization or suicide among youths ages 9 to 18 (N = 34,528) after contact (6–12 months) with a mental-health specialist in Stockholm, Sweden, from 2006 to 2012. Usin...

Full description

Bibliographic Details
Main Authors: O’Reilly, LM, Fazel, S, Rickert, ME, Kuja-Halkola, R, Cederlöf, M, Hellner, C, Larsson, H, Lichtenstein, P, D’Onofrio, BM
Format: Journal article
Language:English
Published: SAGE Publications 2024
Description
Summary:In this article, we assessed the performance of several predictive modeling algorithms of suicide attempt resulting in inpatient hospitalization or suicide among youths ages 9 to 18 (N = 34,528) after contact (6–12 months) with a mental-health specialist in Stockholm, Sweden, from 2006 to 2012. Using 209 predictors across domains (e.g., clinical, demographic, family, neighborhood, social) identified from national registers, we applied standard logistic regression, regularized logistic regression, and machine-learning algorithms (i.e., random forests, gradient boosting, support vector machines). Standard logistic regression (area under the receiver operating characteristic curve [AUC] = 0.77, 95% confidence interval [CI] = [0.72, 0.82]) and random-forest models (AUC = 0.80, 95% CI = [0.74, 0.86]) demonstrated the highest AUCs. Sensitivities ranged from 0.33 (support vector machines) to 0.91 (standard logistic regression). Although the study was underpowered to detect a difference between logistic regression and machine-learning algorithms (outcome prevalence = 0.7%), performance metrics were similar across models. Logistic regression is not clearly worse than machine-learning approaches. Ongoing research is needed to examine how prediction models can augment clinical decision-making.