A model to forecast the two-year variation of subjective wellbeing in the elderly population

Abstract Background The ageing global population presents significant public health challenges, especially in relation to the subjective wellbeing of the elderly. In this study, our aim was to investigate the potential for developing a model to forecast the two-year variation of the perceived wellbe...

Full description

Bibliographic Details
Main Authors: Isotta Trescato, Chiara Roversi, Martina Vettoretti, Barbara Di Camillo, Andrea Facchinetti
Format: Article
Language:English
Published: BMC 2023-11-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-023-02360-8
Description
Summary:Abstract Background The ageing global population presents significant public health challenges, especially in relation to the subjective wellbeing of the elderly. In this study, our aim was to investigate the potential for developing a model to forecast the two-year variation of the perceived wellbeing of individuals aged over 50. We also aimed to identify the variables that predict changes in subjective wellbeing, as measured by the CASP-12 scale, over a two-year period. Methods Data from the European SHARE project were used, specifically the demographic, health, social and financial variables of 9422 subjects. The subjective wellbeing was measured through the CASP-12 scale. The study outcome was defined as binary, i.e., worsening/not worsening of the variation of CASP-12 in 2 years. Logistic regression, logistic regression with LASSO regularisation, and random forest were considered candidate models. Performance was assessed in terms of accuracy in correctly predicting the outcome, Area Under the Curve (AUC), and F1 score. Results The best-performing model was the random forest, achieving an accuracy of 65%, AUC = 0.659, and F1 = 0.710. All models proved to be able to generalise both across subjects and over time. The most predictive variables were the CASP-12 score at baseline, the presence of depression and financial difficulties. Conclusions While we identify the random forest model as the more suitable, given the similarity of performance, the models based on logistic regression or on logistic regression with LASSO regularisation are also possible options.
ISSN:1472-6947