Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study

BackgroundThe increasing number of patients treated with peritoneal dialysis (PD) and their consistently high rate of hospital admissions have placed a large burden on the health care system. Early clinical interventions and optimal management of patients at a high risk of prolonged length of stay (...

Full description

Bibliographic Details
Main Authors: Kong, Guilan, Wu, Jingyi, Chu, Hong, Yang, Chao, Lin, Yu, Lin, Ke, Shi, Ying, Wang, Haibo, Zhang, Luxia
Format: Article
Language:English
Published: JMIR Publications 2021-05-01
Series:JMIR Medical Informatics
Online Access:https://medinform.jmir.org/2021/5/e17886
_version_ 1819226892426805248
author Kong, Guilan
Wu, Jingyi
Chu, Hong
Yang, Chao
Lin, Yu
Lin, Ke
Shi, Ying
Wang, Haibo
Zhang, Luxia
author_facet Kong, Guilan
Wu, Jingyi
Chu, Hong
Yang, Chao
Lin, Yu
Lin, Ke
Shi, Ying
Wang, Haibo
Zhang, Luxia
author_sort Kong, Guilan
collection DOAJ
description BackgroundThe increasing number of patients treated with peritoneal dialysis (PD) and their consistently high rate of hospital admissions have placed a large burden on the health care system. Early clinical interventions and optimal management of patients at a high risk of prolonged length of stay (pLOS) may help improve the medical efficiency and prognosis of PD-treated patients. If timely clinical interventions are not provided, patients at a high risk of pLOS may face a poor prognosis and high medical expenses, which will also be a burden on hospitals. Therefore, physicians need an effective pLOS prediction model for PD-treated patients. ObjectiveThis study aimed to develop an optimal data-driven model for predicting the pLOS risk of PD-treated patients using basic admission data. MethodsPatient data collected using the Hospital Quality Monitoring System (HQMS) in China were used to develop pLOS prediction models. A stacking model was constructed with support vector machine, random forest (RF), and K-nearest neighbor algorithms as its base models and traditional logistic regression (LR) as its meta-model. The meta-model used the outputs of all 3 base models as input and generated the output of the stacking model. Another LR-based pLOS prediction model was built as the benchmark model. The prediction performance of the stacking model was compared with that of its base models and the benchmark model. Five-fold cross-validation was employed to develop and validate the models. Performance measures included the Brier score, area under the receiver operating characteristic curve (AUROC), estimated calibration index (ECI), accuracy, sensitivity, specificity, and geometric mean (Gm). In addition, a calibration plot was employed to visually demonstrate the calibration power of each model. ResultsThe final cohort extracted from the HQMS database consisted of 23,992 eligible PD-treated patients, among whom 30.3% had a pLOS (ie, longer than the average LOS, which was 16 days in our study). Among the models, the stacking model achieved the best calibration (ECI 8.691), balanced accuracy (Gm 0.690), accuracy (0.695), and specificity (0.701). Meanwhile, the stacking and RF models had the best overall performance (Brier score 0.174 for both) and discrimination (AUROC 0.757 for the stacking model and 0.756 for the RF model). Compared with the benchmark LR model, the stacking model was superior in all performance measures except sensitivity, but there was no significant difference in sensitivity between the 2 models. The 2-sided t tests revealed significant performance differences between the stacking and LR models in overall performance, discrimination, calibration, balanced accuracy, and accuracy. ConclusionsThis study is the first to develop data-driven pLOS prediction models for PD-treated patients using basic admission data from a national database. The results indicate the feasibility of utilizing a stacking-based pLOS prediction model for PD-treated patients. The pLOS prediction tools developed in this study have the potential to assist clinicians in identifying patients at a high risk of pLOS and to allocate resources optimally for PD-treated patients.
first_indexed 2024-12-23T10:32:43Z
format Article
id doaj.art-e3a4f99a74f146ff91c7d7e2ce390011
institution Directory Open Access Journal
issn 2291-9694
language English
last_indexed 2024-12-23T10:32:43Z
publishDate 2021-05-01
publisher JMIR Publications
record_format Article
series JMIR Medical Informatics
spelling doaj.art-e3a4f99a74f146ff91c7d7e2ce3900112022-12-21T17:50:24ZengJMIR PublicationsJMIR Medical Informatics2291-96942021-05-0195e1788610.2196/17886Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation StudyKong, GuilanWu, JingyiChu, HongYang, ChaoLin, YuLin, KeShi, YingWang, HaiboZhang, LuxiaBackgroundThe increasing number of patients treated with peritoneal dialysis (PD) and their consistently high rate of hospital admissions have placed a large burden on the health care system. Early clinical interventions and optimal management of patients at a high risk of prolonged length of stay (pLOS) may help improve the medical efficiency and prognosis of PD-treated patients. If timely clinical interventions are not provided, patients at a high risk of pLOS may face a poor prognosis and high medical expenses, which will also be a burden on hospitals. Therefore, physicians need an effective pLOS prediction model for PD-treated patients. ObjectiveThis study aimed to develop an optimal data-driven model for predicting the pLOS risk of PD-treated patients using basic admission data. MethodsPatient data collected using the Hospital Quality Monitoring System (HQMS) in China were used to develop pLOS prediction models. A stacking model was constructed with support vector machine, random forest (RF), and K-nearest neighbor algorithms as its base models and traditional logistic regression (LR) as its meta-model. The meta-model used the outputs of all 3 base models as input and generated the output of the stacking model. Another LR-based pLOS prediction model was built as the benchmark model. The prediction performance of the stacking model was compared with that of its base models and the benchmark model. Five-fold cross-validation was employed to develop and validate the models. Performance measures included the Brier score, area under the receiver operating characteristic curve (AUROC), estimated calibration index (ECI), accuracy, sensitivity, specificity, and geometric mean (Gm). In addition, a calibration plot was employed to visually demonstrate the calibration power of each model. ResultsThe final cohort extracted from the HQMS database consisted of 23,992 eligible PD-treated patients, among whom 30.3% had a pLOS (ie, longer than the average LOS, which was 16 days in our study). Among the models, the stacking model achieved the best calibration (ECI 8.691), balanced accuracy (Gm 0.690), accuracy (0.695), and specificity (0.701). Meanwhile, the stacking and RF models had the best overall performance (Brier score 0.174 for both) and discrimination (AUROC 0.757 for the stacking model and 0.756 for the RF model). Compared with the benchmark LR model, the stacking model was superior in all performance measures except sensitivity, but there was no significant difference in sensitivity between the 2 models. The 2-sided t tests revealed significant performance differences between the stacking and LR models in overall performance, discrimination, calibration, balanced accuracy, and accuracy. ConclusionsThis study is the first to develop data-driven pLOS prediction models for PD-treated patients using basic admission data from a national database. The results indicate the feasibility of utilizing a stacking-based pLOS prediction model for PD-treated patients. The pLOS prediction tools developed in this study have the potential to assist clinicians in identifying patients at a high risk of pLOS and to allocate resources optimally for PD-treated patients.https://medinform.jmir.org/2021/5/e17886
spellingShingle Kong, Guilan
Wu, Jingyi
Chu, Hong
Yang, Chao
Lin, Yu
Lin, Ke
Shi, Ying
Wang, Haibo
Zhang, Luxia
Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study
JMIR Medical Informatics
title Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study
title_full Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study
title_fullStr Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study
title_full_unstemmed Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study
title_short Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study
title_sort predicting prolonged length of hospital stay for peritoneal dialysis treated patients using stacked generalization model development and validation study
url https://medinform.jmir.org/2021/5/e17886
work_keys_str_mv AT kongguilan predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT wujingyi predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT chuhong predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT yangchao predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT linyu predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT linke predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT shiying predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT wanghaibo predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy
AT zhangluxia predictingprolongedlengthofhospitalstayforperitonealdialysistreatedpatientsusingstackedgeneralizationmodeldevelopmentandvalidationstudy