Machine learning for patient risk stratification for acute respiratory distress syndrome.

<h4>Background</h4>Existing prediction models for acute respiratory distress syndrome (ARDS) require manual chart abstraction and have only fair performance-limiting their suitability for driving clinical interventions. We sought to develop a machine learning approach for the prediction...

Full description

Bibliographic Details
Main Authors: Daniel Zeiberg, Tejas Prahlad, Brahmajee K Nallamothu, Theodore J Iwashyna, Jenna Wiens, Michael W Sjoding
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0214465
_version_ 1818899737327173632
author Daniel Zeiberg
Tejas Prahlad
Brahmajee K Nallamothu
Theodore J Iwashyna
Jenna Wiens
Michael W Sjoding
author_facet Daniel Zeiberg
Tejas Prahlad
Brahmajee K Nallamothu
Theodore J Iwashyna
Jenna Wiens
Michael W Sjoding
author_sort Daniel Zeiberg
collection DOAJ
description <h4>Background</h4>Existing prediction models for acute respiratory distress syndrome (ARDS) require manual chart abstraction and have only fair performance-limiting their suitability for driving clinical interventions. We sought to develop a machine learning approach for the prediction of ARDS that (a) leverages electronic health record (EHR) data, (b) is fully automated, and (c) can be applied at clinically relevant time points throughout a patient's stay.<h4>Methods and findings</h4>We trained a risk stratification model for ARDS using a cohort of 1,621 patients with moderate hypoxia from a single center in 2016, of which 51 patients developed ARDS. We tested the model in a temporally distinct cohort of 1,122 patients from 2017, of which 27 patients developed ARDS. Gold standard diagnosis of ARDS was made by intensive care trained physicians during retrospective chart review. We considered both linear and non-linear approaches to learning the model. The best model used L2-logistic regression with 984 features extracted from the EHR. For patients observed in the hospital at least six hours who then developed moderate hypoxia, the model achieved an area under the receiver operating characteristics curve (AUROC) of 0.81 (95% CI: 0.73-0.88). Selecting a threshold based on the 85th percentile of risk, the model had a sensitivity of 56% (95% CI: 35%, 74%), specificity of 86% (95% CI: 85%, 87%) and positive predictive value of 9% (95% CI: 5%, 14%), identifying a population at four times higher risk for ARDS than other patients with moderate hypoxia and 17 times the risk of hospitalized adults.<h4>Conclusions</h4>We developed an ARDS prediction model based on EHR data with good discriminative performance. Our results demonstrate the feasibility of a machine learning approach to risk stratifying patients for ARDS solely from data extracted automatically from the EHR.
first_indexed 2024-12-19T19:52:43Z
format Article
id doaj.art-d260ff0580e746cb9ef7440c9e0c6777
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-19T19:52:43Z
publishDate 2019-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-d260ff0580e746cb9ef7440c9e0c67772022-12-21T20:07:55ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01143e021446510.1371/journal.pone.0214465Machine learning for patient risk stratification for acute respiratory distress syndrome.Daniel ZeibergTejas PrahladBrahmajee K NallamothuTheodore J IwashynaJenna WiensMichael W Sjoding<h4>Background</h4>Existing prediction models for acute respiratory distress syndrome (ARDS) require manual chart abstraction and have only fair performance-limiting their suitability for driving clinical interventions. We sought to develop a machine learning approach for the prediction of ARDS that (a) leverages electronic health record (EHR) data, (b) is fully automated, and (c) can be applied at clinically relevant time points throughout a patient's stay.<h4>Methods and findings</h4>We trained a risk stratification model for ARDS using a cohort of 1,621 patients with moderate hypoxia from a single center in 2016, of which 51 patients developed ARDS. We tested the model in a temporally distinct cohort of 1,122 patients from 2017, of which 27 patients developed ARDS. Gold standard diagnosis of ARDS was made by intensive care trained physicians during retrospective chart review. We considered both linear and non-linear approaches to learning the model. The best model used L2-logistic regression with 984 features extracted from the EHR. For patients observed in the hospital at least six hours who then developed moderate hypoxia, the model achieved an area under the receiver operating characteristics curve (AUROC) of 0.81 (95% CI: 0.73-0.88). Selecting a threshold based on the 85th percentile of risk, the model had a sensitivity of 56% (95% CI: 35%, 74%), specificity of 86% (95% CI: 85%, 87%) and positive predictive value of 9% (95% CI: 5%, 14%), identifying a population at four times higher risk for ARDS than other patients with moderate hypoxia and 17 times the risk of hospitalized adults.<h4>Conclusions</h4>We developed an ARDS prediction model based on EHR data with good discriminative performance. Our results demonstrate the feasibility of a machine learning approach to risk stratifying patients for ARDS solely from data extracted automatically from the EHR.https://doi.org/10.1371/journal.pone.0214465
spellingShingle Daniel Zeiberg
Tejas Prahlad
Brahmajee K Nallamothu
Theodore J Iwashyna
Jenna Wiens
Michael W Sjoding
Machine learning for patient risk stratification for acute respiratory distress syndrome.
PLoS ONE
title Machine learning for patient risk stratification for acute respiratory distress syndrome.
title_full Machine learning for patient risk stratification for acute respiratory distress syndrome.
title_fullStr Machine learning for patient risk stratification for acute respiratory distress syndrome.
title_full_unstemmed Machine learning for patient risk stratification for acute respiratory distress syndrome.
title_short Machine learning for patient risk stratification for acute respiratory distress syndrome.
title_sort machine learning for patient risk stratification for acute respiratory distress syndrome
url https://doi.org/10.1371/journal.pone.0214465
work_keys_str_mv AT danielzeiberg machinelearningforpatientriskstratificationforacuterespiratorydistresssyndrome
AT tejasprahlad machinelearningforpatientriskstratificationforacuterespiratorydistresssyndrome
AT brahmajeeknallamothu machinelearningforpatientriskstratificationforacuterespiratorydistresssyndrome
AT theodorejiwashyna machinelearningforpatientriskstratificationforacuterespiratorydistresssyndrome
AT jennawiens machinelearningforpatientriskstratificationforacuterespiratorydistresssyndrome
AT michaelwsjoding machinelearningforpatientriskstratificationforacuterespiratorydistresssyndrome