Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis

Background: Early recognition of sepsis and the prediction of mortality in patients with infection are important. This multi-center, ED-based study aimed to develop and validate a 28-day mortality prediction model for patients with infection using various machine learning (ML) algorithms. Methods: P...

Full description

Bibliographic Details
Main Authors: Hsiao-Yun Chao, Chin-Chieh Wu, Avichandra Singh, Andrew Shedd, Jon Wolfshohl, Eric H. Chou, Yhu-Chering Huang, Kuan-Fu Chen
Format: Article
Language:English
Published: MDPI AG 2022-03-01
Series:Biomedicines
Subjects:
Online Access:https://www.mdpi.com/2227-9059/10/4/802
_version_ 1797436841787392000
author Hsiao-Yun Chao
Chin-Chieh Wu
Avichandra Singh
Andrew Shedd
Jon Wolfshohl
Eric H. Chou
Yhu-Chering Huang
Kuan-Fu Chen
author_facet Hsiao-Yun Chao
Chin-Chieh Wu
Avichandra Singh
Andrew Shedd
Jon Wolfshohl
Eric H. Chou
Yhu-Chering Huang
Kuan-Fu Chen
author_sort Hsiao-Yun Chao
collection DOAJ
description Background: Early recognition of sepsis and the prediction of mortality in patients with infection are important. This multi-center, ED-based study aimed to develop and validate a 28-day mortality prediction model for patients with infection using various machine learning (ML) algorithms. Methods: Patients with acute infection requiring intravenous antibiotic treatment during the first 24 h of admission were prospectively recruited. Patient demographics, comorbidities, clinical signs and symptoms, laboratory test data, selected sepsis-related novel biomarkers, and 28-day mortality were collected and divided into training (70%) and testing (30%) datasets. Logistic regression and seven ML algorithms were used to develop the prediction models. The area under the receiver operating characteristic curve (AUROC) was used to compare different models. Results: A total of 555 patients were recruited with a full panel of biomarker tests. Among them, 18% fulfilled Sepsis-3 criteria, with a 28-day mortality rate of 8%. The wrapper algorithm selected 30 features, including disease severity scores, biochemical parameters, and conventional and few sepsis-related biomarkers. Random forest outperformed other ML models (AUROC: 0.96; 95% confidence interval: 0.93–0.98) and SOFA and early warning scores (AUROC: 0.64–0.84) in the prediction of 28-day mortality in patients with infection. Additionally, random forest remained the best-performing model, with an AUROC of 0.95 (95% CI: 0.91–0.98, <i>p</i> = 0.725) after removing five sepsis-related novel biomarkers. Conclusions: Our results demonstrated that ML models provide a more accurate prediction of 28-day mortality with an enhanced ability in dealing with multi-dimensional data than the logistic regression model.
first_indexed 2024-03-09T11:07:27Z
format Article
id doaj.art-baf2524c6e2b4c2a9a23f74db30d37d2
institution Directory Open Access Journal
issn 2227-9059
language English
last_indexed 2024-03-09T11:07:27Z
publishDate 2022-03-01
publisher MDPI AG
record_format Article
series Biomedicines
spelling doaj.art-baf2524c6e2b4c2a9a23f74db30d37d22023-12-01T00:53:33ZengMDPI AGBiomedicines2227-90592022-03-0110480210.3390/biomedicines10040802Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected SepsisHsiao-Yun Chao0Chin-Chieh Wu1Avichandra Singh2Andrew Shedd3Jon Wolfshohl4Eric H. Chou5Yhu-Chering Huang6Kuan-Fu Chen7Department of Emergency Medicine, Linkou Chang Gung Memorial Hospital, No. 5, Fu-Shin Street, Gueishan Village, Taoyuan 333423, TaiwanClinical Informatics and Medical Statistics Research Center, Chang Gung University, Taoyuan 33302, TaiwanDepartment of Emergency Medicine, Keelung Chang Gung Memorial Hospital, Keelung 20401, TaiwanDepartment of Emergency Medicine, Baylor Scott and White All Saints Medical Center, Fort Worth, TX 76104, USADepartment of Emergency Medicine, Baylor Scott and White All Saints Medical Center, Fort Worth, TX 76104, USADepartment of Emergency Medicine, Baylor Scott and White All Saints Medical Center, Fort Worth, TX 76104, USADivision of Pediatric Infectious Diseases, Linkou Chang Gung Memorial Hospital, No. 5, Fu-Shin Street, Gueishan Village, Taoyuan 333423, TaiwanDepartment of Emergency Medicine, Linkou Chang Gung Memorial Hospital, No. 5, Fu-Shin Street, Gueishan Village, Taoyuan 333423, TaiwanBackground: Early recognition of sepsis and the prediction of mortality in patients with infection are important. This multi-center, ED-based study aimed to develop and validate a 28-day mortality prediction model for patients with infection using various machine learning (ML) algorithms. Methods: Patients with acute infection requiring intravenous antibiotic treatment during the first 24 h of admission were prospectively recruited. Patient demographics, comorbidities, clinical signs and symptoms, laboratory test data, selected sepsis-related novel biomarkers, and 28-day mortality were collected and divided into training (70%) and testing (30%) datasets. Logistic regression and seven ML algorithms were used to develop the prediction models. The area under the receiver operating characteristic curve (AUROC) was used to compare different models. Results: A total of 555 patients were recruited with a full panel of biomarker tests. Among them, 18% fulfilled Sepsis-3 criteria, with a 28-day mortality rate of 8%. The wrapper algorithm selected 30 features, including disease severity scores, biochemical parameters, and conventional and few sepsis-related biomarkers. Random forest outperformed other ML models (AUROC: 0.96; 95% confidence interval: 0.93–0.98) and SOFA and early warning scores (AUROC: 0.64–0.84) in the prediction of 28-day mortality in patients with infection. Additionally, random forest remained the best-performing model, with an AUROC of 0.95 (95% CI: 0.91–0.98, <i>p</i> = 0.725) after removing five sepsis-related novel biomarkers. Conclusions: Our results demonstrated that ML models provide a more accurate prediction of 28-day mortality with an enhanced ability in dealing with multi-dimensional data than the logistic regression model.https://www.mdpi.com/2227-9059/10/4/802biomarkerlogistic regressionmachine learningmortality predictionsepsis
spellingShingle Hsiao-Yun Chao
Chin-Chieh Wu
Avichandra Singh
Andrew Shedd
Jon Wolfshohl
Eric H. Chou
Yhu-Chering Huang
Kuan-Fu Chen
Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis
Biomedicines
biomarker
logistic regression
machine learning
mortality prediction
sepsis
title Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis
title_full Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis
title_fullStr Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis
title_full_unstemmed Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis
title_short Using Machine Learning to Develop and Validate an In-Hospital Mortality Prediction Model for Patients with Suspected Sepsis
title_sort using machine learning to develop and validate an in hospital mortality prediction model for patients with suspected sepsis
topic biomarker
logistic regression
machine learning
mortality prediction
sepsis
url https://www.mdpi.com/2227-9059/10/4/802
work_keys_str_mv AT hsiaoyunchao usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT chinchiehwu usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT avichandrasingh usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT andrewshedd usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT jonwolfshohl usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT erichchou usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT yhucheringhuang usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis
AT kuanfuchen usingmachinelearningtodevelopandvalidateaninhospitalmortalitypredictionmodelforpatientswithsuspectedsepsis