Care-needs level prediction for elderly long-term care using insurance claims data

Background and objective: Owing to an aging population, the increase in the number of elderly people certified as requiring long-term care has become a critical social issue in Japan. This study aimed to construct a machine learning model predicting the maximum care-needs level required for long-ter...

Full description

Bibliographic Details
Main Authors: Hiroaki Fukunishi, Yasuki Kobayashi
Format: Article
Language:English
Published: Elsevier 2023-01-01
Series:Informatics in Medicine Unlocked
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352914823001673
_version_ 1797690922165600256
author Hiroaki Fukunishi
Yasuki Kobayashi
author_facet Hiroaki Fukunishi
Yasuki Kobayashi
author_sort Hiroaki Fukunishi
collection DOAJ
description Background and objective: Owing to an aging population, the increase in the number of elderly people certified as requiring long-term care has become a critical social issue in Japan. This study aimed to construct a machine learning model predicting the maximum care-needs level required for long-term care within the next three years for persons aged over 75 years. Methods: The prediction model was constructed using features extracted from long-term care and healthcare insurance claims data. The study subjects were a total of 47,862 elderly individuals who had not received long-term care services in a large city in Japan. The prediction classes for outcome variable were categorized according to the criteria of the Japanese long-term care system: class 0 (no required), class 1 (support levels 1 and 2), class 2 (care levels 1 and 2), and class 3 (care levels 3–5). As explanatory variables, a total of 516 features were used, including age, sex, and 514 diseases classified under ICD-10. In this study, we focused on constructing a prediction model with the interpretability and adopted multinomial logistic regression (MLR) with L2 regularization as a machine learning algorithm. MLR allowed us to identify the characteristics influencing each prediction class of care-needs levels. Results: In terms of overall predictive performance, MLR achieved weighted average precision, recall, F-value, and lift scores of 0.694, 0.505, 0.567, and 1.333, respectively. Compared to other machine learning algorithms, MLR demonstrated comparable performance to Support Vector Machine (SVM) and Random Forest (RF). From the factor analysis based on the magnitudes of coefficients of the MLR model, the top three features influencing each prediction class were as follows: class1: female sex, hypertension, and gonarthrosis; class 2: age, Alzheimer-type dementia, and neuromuscular dysfunction of the bladder; class 3: age, Alzheimer-type dementia, and type 2 diabetes mellitus. Conclusions: In practical terms, the care-needs level prediction can be applied by local governments to identify high-risk areas by comprehensively and routinely predicting insured persons under public health insurance and long-term care insurance systems.
first_indexed 2024-03-12T02:06:04Z
format Article
id doaj.art-57dd83372b3e450aa64c69741b490cc1
institution Directory Open Access Journal
issn 2352-9148
language English
last_indexed 2024-03-12T02:06:04Z
publishDate 2023-01-01
publisher Elsevier
record_format Article
series Informatics in Medicine Unlocked
spelling doaj.art-57dd83372b3e450aa64c69741b490cc12023-09-07T04:44:14ZengElsevierInformatics in Medicine Unlocked2352-91482023-01-0141101321Care-needs level prediction for elderly long-term care using insurance claims dataHiroaki Fukunishi0Yasuki Kobayashi1School of Computer Science, Tokyo University of Technology, Hachioji City, Japan; Corresponding author. 1404-1 Katakuramachi, Hachioji City, Tokyo, 192-0982, Japan.Department of Public Health, Graduate School of Medicine, The University of Tokyo, Tokyo, JapanBackground and objective: Owing to an aging population, the increase in the number of elderly people certified as requiring long-term care has become a critical social issue in Japan. This study aimed to construct a machine learning model predicting the maximum care-needs level required for long-term care within the next three years for persons aged over 75 years. Methods: The prediction model was constructed using features extracted from long-term care and healthcare insurance claims data. The study subjects were a total of 47,862 elderly individuals who had not received long-term care services in a large city in Japan. The prediction classes for outcome variable were categorized according to the criteria of the Japanese long-term care system: class 0 (no required), class 1 (support levels 1 and 2), class 2 (care levels 1 and 2), and class 3 (care levels 3–5). As explanatory variables, a total of 516 features were used, including age, sex, and 514 diseases classified under ICD-10. In this study, we focused on constructing a prediction model with the interpretability and adopted multinomial logistic regression (MLR) with L2 regularization as a machine learning algorithm. MLR allowed us to identify the characteristics influencing each prediction class of care-needs levels. Results: In terms of overall predictive performance, MLR achieved weighted average precision, recall, F-value, and lift scores of 0.694, 0.505, 0.567, and 1.333, respectively. Compared to other machine learning algorithms, MLR demonstrated comparable performance to Support Vector Machine (SVM) and Random Forest (RF). From the factor analysis based on the magnitudes of coefficients of the MLR model, the top three features influencing each prediction class were as follows: class1: female sex, hypertension, and gonarthrosis; class 2: age, Alzheimer-type dementia, and neuromuscular dysfunction of the bladder; class 3: age, Alzheimer-type dementia, and type 2 diabetes mellitus. Conclusions: In practical terms, the care-needs level prediction can be applied by local governments to identify high-risk areas by comprehensively and routinely predicting insured persons under public health insurance and long-term care insurance systems.http://www.sciencedirect.com/science/article/pii/S2352914823001673Long-term careClaims dataMachine learningInterpretable prediction modelCare-needs level prediction
spellingShingle Hiroaki Fukunishi
Yasuki Kobayashi
Care-needs level prediction for elderly long-term care using insurance claims data
Informatics in Medicine Unlocked
Long-term care
Claims data
Machine learning
Interpretable prediction model
Care-needs level prediction
title Care-needs level prediction for elderly long-term care using insurance claims data
title_full Care-needs level prediction for elderly long-term care using insurance claims data
title_fullStr Care-needs level prediction for elderly long-term care using insurance claims data
title_full_unstemmed Care-needs level prediction for elderly long-term care using insurance claims data
title_short Care-needs level prediction for elderly long-term care using insurance claims data
title_sort care needs level prediction for elderly long term care using insurance claims data
topic Long-term care
Claims data
Machine learning
Interpretable prediction model
Care-needs level prediction
url http://www.sciencedirect.com/science/article/pii/S2352914823001673
work_keys_str_mv AT hiroakifukunishi careneedslevelpredictionforelderlylongtermcareusinginsuranceclaimsdata
AT yasukikobayashi careneedslevelpredictionforelderlylongtermcareusinginsuranceclaimsdata