Care-needs level prediction for elderly long-term care using insurance claims data
Background and objective: Owing to an aging population, the increase in the number of elderly people certified as requiring long-term care has become a critical social issue in Japan. This study aimed to construct a machine learning model predicting the maximum care-needs level required for long-ter...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2023-01-01
|
Series: | Informatics in Medicine Unlocked |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352914823001673 |
_version_ | 1797690922165600256 |
---|---|
author | Hiroaki Fukunishi Yasuki Kobayashi |
author_facet | Hiroaki Fukunishi Yasuki Kobayashi |
author_sort | Hiroaki Fukunishi |
collection | DOAJ |
description | Background and objective: Owing to an aging population, the increase in the number of elderly people certified as requiring long-term care has become a critical social issue in Japan. This study aimed to construct a machine learning model predicting the maximum care-needs level required for long-term care within the next three years for persons aged over 75 years. Methods: The prediction model was constructed using features extracted from long-term care and healthcare insurance claims data. The study subjects were a total of 47,862 elderly individuals who had not received long-term care services in a large city in Japan. The prediction classes for outcome variable were categorized according to the criteria of the Japanese long-term care system: class 0 (no required), class 1 (support levels 1 and 2), class 2 (care levels 1 and 2), and class 3 (care levels 3–5). As explanatory variables, a total of 516 features were used, including age, sex, and 514 diseases classified under ICD-10. In this study, we focused on constructing a prediction model with the interpretability and adopted multinomial logistic regression (MLR) with L2 regularization as a machine learning algorithm. MLR allowed us to identify the characteristics influencing each prediction class of care-needs levels. Results: In terms of overall predictive performance, MLR achieved weighted average precision, recall, F-value, and lift scores of 0.694, 0.505, 0.567, and 1.333, respectively. Compared to other machine learning algorithms, MLR demonstrated comparable performance to Support Vector Machine (SVM) and Random Forest (RF). From the factor analysis based on the magnitudes of coefficients of the MLR model, the top three features influencing each prediction class were as follows: class1: female sex, hypertension, and gonarthrosis; class 2: age, Alzheimer-type dementia, and neuromuscular dysfunction of the bladder; class 3: age, Alzheimer-type dementia, and type 2 diabetes mellitus. Conclusions: In practical terms, the care-needs level prediction can be applied by local governments to identify high-risk areas by comprehensively and routinely predicting insured persons under public health insurance and long-term care insurance systems. |
first_indexed | 2024-03-12T02:06:04Z |
format | Article |
id | doaj.art-57dd83372b3e450aa64c69741b490cc1 |
institution | Directory Open Access Journal |
issn | 2352-9148 |
language | English |
last_indexed | 2024-03-12T02:06:04Z |
publishDate | 2023-01-01 |
publisher | Elsevier |
record_format | Article |
series | Informatics in Medicine Unlocked |
spelling | doaj.art-57dd83372b3e450aa64c69741b490cc12023-09-07T04:44:14ZengElsevierInformatics in Medicine Unlocked2352-91482023-01-0141101321Care-needs level prediction for elderly long-term care using insurance claims dataHiroaki Fukunishi0Yasuki Kobayashi1School of Computer Science, Tokyo University of Technology, Hachioji City, Japan; Corresponding author. 1404-1 Katakuramachi, Hachioji City, Tokyo, 192-0982, Japan.Department of Public Health, Graduate School of Medicine, The University of Tokyo, Tokyo, JapanBackground and objective: Owing to an aging population, the increase in the number of elderly people certified as requiring long-term care has become a critical social issue in Japan. This study aimed to construct a machine learning model predicting the maximum care-needs level required for long-term care within the next three years for persons aged over 75 years. Methods: The prediction model was constructed using features extracted from long-term care and healthcare insurance claims data. The study subjects were a total of 47,862 elderly individuals who had not received long-term care services in a large city in Japan. The prediction classes for outcome variable were categorized according to the criteria of the Japanese long-term care system: class 0 (no required), class 1 (support levels 1 and 2), class 2 (care levels 1 and 2), and class 3 (care levels 3–5). As explanatory variables, a total of 516 features were used, including age, sex, and 514 diseases classified under ICD-10. In this study, we focused on constructing a prediction model with the interpretability and adopted multinomial logistic regression (MLR) with L2 regularization as a machine learning algorithm. MLR allowed us to identify the characteristics influencing each prediction class of care-needs levels. Results: In terms of overall predictive performance, MLR achieved weighted average precision, recall, F-value, and lift scores of 0.694, 0.505, 0.567, and 1.333, respectively. Compared to other machine learning algorithms, MLR demonstrated comparable performance to Support Vector Machine (SVM) and Random Forest (RF). From the factor analysis based on the magnitudes of coefficients of the MLR model, the top three features influencing each prediction class were as follows: class1: female sex, hypertension, and gonarthrosis; class 2: age, Alzheimer-type dementia, and neuromuscular dysfunction of the bladder; class 3: age, Alzheimer-type dementia, and type 2 diabetes mellitus. Conclusions: In practical terms, the care-needs level prediction can be applied by local governments to identify high-risk areas by comprehensively and routinely predicting insured persons under public health insurance and long-term care insurance systems.http://www.sciencedirect.com/science/article/pii/S2352914823001673Long-term careClaims dataMachine learningInterpretable prediction modelCare-needs level prediction |
spellingShingle | Hiroaki Fukunishi Yasuki Kobayashi Care-needs level prediction for elderly long-term care using insurance claims data Informatics in Medicine Unlocked Long-term care Claims data Machine learning Interpretable prediction model Care-needs level prediction |
title | Care-needs level prediction for elderly long-term care using insurance claims data |
title_full | Care-needs level prediction for elderly long-term care using insurance claims data |
title_fullStr | Care-needs level prediction for elderly long-term care using insurance claims data |
title_full_unstemmed | Care-needs level prediction for elderly long-term care using insurance claims data |
title_short | Care-needs level prediction for elderly long-term care using insurance claims data |
title_sort | care needs level prediction for elderly long term care using insurance claims data |
topic | Long-term care Claims data Machine learning Interpretable prediction model Care-needs level prediction |
url | http://www.sciencedirect.com/science/article/pii/S2352914823001673 |
work_keys_str_mv | AT hiroakifukunishi careneedslevelpredictionforelderlylongtermcareusinginsuranceclaimsdata AT yasukikobayashi careneedslevelpredictionforelderlylongtermcareusinginsuranceclaimsdata |