Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost

Abstract Background Sepsis is a significant cause of mortality in-hospital, especially in ICU patients. Early prediction of sepsis is essential, as prompt and appropriate treatment can improve survival outcomes. Machine learning methods are flexible prediction algorithms with potential advantages ov...

Full description

Bibliographic Details
Main Authors: Nianzong Hou, Mingzhe Li, Lu He, Bing Xie, Lin Wang, Rumin Zhang, Yong Yu, Xiaodong Sun, Zhengsheng Pan, Kai Wang
Format: Article
Language:English
Published: BMC 2020-12-01
Series:Journal of Translational Medicine
Subjects:
Online Access:https://doi.org/10.1186/s12967-020-02620-5
_version_ 1818439418079346688
author Nianzong Hou
Mingzhe Li
Lu He
Bing Xie
Lin Wang
Rumin Zhang
Yong Yu
Xiaodong Sun
Zhengsheng Pan
Kai Wang
author_facet Nianzong Hou
Mingzhe Li
Lu He
Bing Xie
Lin Wang
Rumin Zhang
Yong Yu
Xiaodong Sun
Zhengsheng Pan
Kai Wang
author_sort Nianzong Hou
collection DOAJ
description Abstract Background Sepsis is a significant cause of mortality in-hospital, especially in ICU patients. Early prediction of sepsis is essential, as prompt and appropriate treatment can improve survival outcomes. Machine learning methods are flexible prediction algorithms with potential advantages over conventional regression and scoring system. The aims of this study were to develop a machine learning approach using XGboost to predict the 30-days mortality for MIMIC-III Patients with sepsis-3 and to determine whether such model performs better than traditional prediction models. Methods Using the MIMIC-III v1.4, we identified patients with sepsis-3. The data was split into two groups based on death or survival within 30 days and variables, selected based on clinical significance and availability by stepwise analysis, were displayed and compared between groups. Three predictive models including conventional logistic regression model, SAPS-II score prediction model and XGBoost algorithm model were constructed by R software. Then, the performances of the three models were tested and compared by AUCs of the receiver operating characteristic curves and decision curve analysis. At last, nomogram and clinical impact curve were used to validate the model. Results A total of 4559 sepsis-3 patients are included in the study, in which, 889 patients were death and 3670 survival within 30 days, respectively. According to the results of AUCs (0.819 [95% CI 0.800–0.838], 0.797 [95% CI 0.781–0.813] and 0.857 [95% CI 0.839–0.876]) and decision curve analysis for the three models, the XGboost model performs best. The risk nomogram and clinical impact curve verify that the XGboost model possesses significant predictive value. Conclusions Using machine learning technique by XGboost, more significant prediction model can be built. This XGboost model may prove clinically useful and assist clinicians in tailoring precise management and therapy for the patients with sepsis-3.
first_indexed 2024-12-14T17:56:08Z
format Article
id doaj.art-7f5fed9e15f34fccba407542deb52b8c
institution Directory Open Access Journal
issn 1479-5876
language English
last_indexed 2024-12-14T17:56:08Z
publishDate 2020-12-01
publisher BMC
record_format Article
series Journal of Translational Medicine
spelling doaj.art-7f5fed9e15f34fccba407542deb52b8c2022-12-21T22:52:33ZengBMCJournal of Translational Medicine1479-58762020-12-0118111410.1186/s12967-020-02620-5Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboostNianzong Hou0Mingzhe Li1Lu He2Bing Xie3Lin Wang4Rumin Zhang5Yong Yu6Xiaodong Sun7Zhengsheng Pan8Kai Wang9Department of Hand and Foot Surgery, Zibo Central Hospital, Shandong First Medical UniversityIndependent researcherInstitute of Medicine and Nursing, Hubei University of MedicineDepartment of Hand and Foot Surgery, Zibo Central Hospital, Shandong First Medical UniversityDepartment of Critical Care Medicine, Zibo Central Hospital, Shandong First Medical UniversityDepartment of Critical Care Medicine, Zibo Central Hospital, Shandong First Medical UniversityDepartment of Critical Care Medicine, Zibo Central Hospital, Shandong First Medical UniversityFengnan District Maternal and Child Health Care Hospital of Tangshan CityDepartment of Urology Surgery, Zibo Central Hospital, Shandong First Medical UniversityDepartment of Critical Care Medicine, Zibo Central Hospital, Shandong First Medical UniversityAbstract Background Sepsis is a significant cause of mortality in-hospital, especially in ICU patients. Early prediction of sepsis is essential, as prompt and appropriate treatment can improve survival outcomes. Machine learning methods are flexible prediction algorithms with potential advantages over conventional regression and scoring system. The aims of this study were to develop a machine learning approach using XGboost to predict the 30-days mortality for MIMIC-III Patients with sepsis-3 and to determine whether such model performs better than traditional prediction models. Methods Using the MIMIC-III v1.4, we identified patients with sepsis-3. The data was split into two groups based on death or survival within 30 days and variables, selected based on clinical significance and availability by stepwise analysis, were displayed and compared between groups. Three predictive models including conventional logistic regression model, SAPS-II score prediction model and XGBoost algorithm model were constructed by R software. Then, the performances of the three models were tested and compared by AUCs of the receiver operating characteristic curves and decision curve analysis. At last, nomogram and clinical impact curve were used to validate the model. Results A total of 4559 sepsis-3 patients are included in the study, in which, 889 patients were death and 3670 survival within 30 days, respectively. According to the results of AUCs (0.819 [95% CI 0.800–0.838], 0.797 [95% CI 0.781–0.813] and 0.857 [95% CI 0.839–0.876]) and decision curve analysis for the three models, the XGboost model performs best. The risk nomogram and clinical impact curve verify that the XGboost model possesses significant predictive value. Conclusions Using machine learning technique by XGboost, more significant prediction model can be built. This XGboost model may prove clinically useful and assist clinicians in tailoring precise management and therapy for the patients with sepsis-3.https://doi.org/10.1186/s12967-020-02620-5MIMIC-IIISepsis-3Machine learningXgboostLogistic regressionSAPS-II score
spellingShingle Nianzong Hou
Mingzhe Li
Lu He
Bing Xie
Lin Wang
Rumin Zhang
Yong Yu
Xiaodong Sun
Zhengsheng Pan
Kai Wang
Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
Journal of Translational Medicine
MIMIC-III
Sepsis-3
Machine learning
Xgboost
Logistic regression
SAPS-II score
title Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
title_full Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
title_fullStr Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
title_full_unstemmed Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
title_short Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
title_sort predicting 30 days mortality for mimic iii patients with sepsis 3 a machine learning approach using xgboost
topic MIMIC-III
Sepsis-3
Machine learning
Xgboost
Logistic regression
SAPS-II score
url https://doi.org/10.1186/s12967-020-02620-5
work_keys_str_mv AT nianzonghou predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT mingzheli predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT luhe predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT bingxie predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT linwang predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT ruminzhang predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT yongyu predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT xiaodongsun predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT zhengshengpan predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost
AT kaiwang predicting30daysmortalityformimiciiipatientswithsepsis3amachinelearningapproachusingxgboost