Unfolding physiological state: mortality modelling in intensive care units

Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the...

Full description

Bibliographic Details
Main Authors: Ghassemi, Marzyeh, Doshi-Velez, Finale, Joshi, Rohit, Rumshisky, Anna, Szolovits, Peter, Naumann, Tristan, Brimmer, Nicole J.
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:en_US
Published: Association for Computing Machinery (ACM) 2016
Online Access:http://hdl.handle.net/1721.1/101049
https://orcid.org/0000-0001-6349-7251
https://orcid.org/0000-0003-2150-1747
https://orcid.org/0000-0001-8411-6403
_version_ 1811079437436846080
author Ghassemi, Marzyeh
Doshi-Velez, Finale
Joshi, Rohit
Rumshisky, Anna
Szolovits, Peter
Naumann, Tristan
Brimmer, Nicole J.
author2 Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Ghassemi, Marzyeh
Doshi-Velez, Finale
Joshi, Rohit
Rumshisky, Anna
Szolovits, Peter
Naumann, Tristan
Brimmer, Nicole J.
author_sort Ghassemi, Marzyeh
collection MIT
description Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: in-hospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work focuses on the dynamic (time-varying) setting because models from this regime could facilitate an on-going severity stratification system that helps direct care-staff resources and inform treatment strategies.
first_indexed 2024-09-23T11:15:00Z
format Article
id mit-1721.1/101049
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T11:15:00Z
publishDate 2016
publisher Association for Computing Machinery (ACM)
record_format dspace
spelling mit-1721.1/1010492022-09-27T18:11:17Z Unfolding physiological state: mortality modelling in intensive care units Ghassemi, Marzyeh Doshi-Velez, Finale Joshi, Rohit Rumshisky, Anna Szolovits, Peter Naumann, Tristan Brimmer, Nicole J. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Ghassemi, Marzyeh Naumann, Tristan Brimmer, Nicole J. Joshi, Rohit Szolovits, Peter Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: in-hospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work focuses on the dynamic (time-varying) setting because models from this regime could facilitate an on-going severity stratification system that helps direct care-staff resources and inform treatment strategies. Intel Corporation. Science and Technology Center for Big Data National Library of Medicine (U.S.) (Biomedical Informatics Research Training Grant NIH/NLM 2T15 LM007092-22) 2016-02-02T01:25:38Z 2016-02-02T01:25:38Z 2014-08 Article http://purl.org/eprint/type/ConferencePaper 9781450329569 http://hdl.handle.net/1721.1/101049 Marzyeh Ghassemi, Tristan Naumann, Finale Doshi-Velez, Nicole Brimmer, Rohit Joshi, Anna Rumshisky, and Peter Szolovits. 2014. Unfolding physiological state: mortality modelling in intensive care units. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '14). ACM, New York, NY, USA, 75-84. https://orcid.org/0000-0001-6349-7251 https://orcid.org/0000-0003-2150-1747 https://orcid.org/0000-0001-8411-6403 en_US http://dx.doi.org/10.1145/2623330.2623742 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '14) Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for Computing Machinery (ACM) PMC
spellingShingle Ghassemi, Marzyeh
Doshi-Velez, Finale
Joshi, Rohit
Rumshisky, Anna
Szolovits, Peter
Naumann, Tristan
Brimmer, Nicole J.
Unfolding physiological state: mortality modelling in intensive care units
title Unfolding physiological state: mortality modelling in intensive care units
title_full Unfolding physiological state: mortality modelling in intensive care units
title_fullStr Unfolding physiological state: mortality modelling in intensive care units
title_full_unstemmed Unfolding physiological state: mortality modelling in intensive care units
title_short Unfolding physiological state: mortality modelling in intensive care units
title_sort unfolding physiological state mortality modelling in intensive care units
url http://hdl.handle.net/1721.1/101049
https://orcid.org/0000-0001-6349-7251
https://orcid.org/0000-0003-2150-1747
https://orcid.org/0000-0001-8411-6403
work_keys_str_mv AT ghassemimarzyeh unfoldingphysiologicalstatemortalitymodellinginintensivecareunits
AT doshivelezfinale unfoldingphysiologicalstatemortalitymodellinginintensivecareunits
AT joshirohit unfoldingphysiologicalstatemortalitymodellinginintensivecareunits
AT rumshiskyanna unfoldingphysiologicalstatemortalitymodellinginintensivecareunits
AT szolovitspeter unfoldingphysiologicalstatemortalitymodellinginintensivecareunits
AT naumanntristan unfoldingphysiologicalstatemortalitymodellinginintensivecareunits
AT brimmernicolej unfoldingphysiologicalstatemortalitymodellinginintensivecareunits