Predictability Bounds of Electronic Health Records

The ability to intervene in disease progression given a person’s disease history has the potential to solve one of society’s most pressing issues: advancing health care delivery and reducing its cost. Controlling disease progression is inherently associated with the ability to predict possible futur...

Full description

Bibliographic Details
Main Authors: Dahlem, Dominik, Maniloff, Diego, Ratti, Carlo
Other Authors: Massachusetts Institute of Technology. Department of Urban Studies and Planning
Format: Article
Language:en_US
Published: Nature Publishing Group 2015
Online Access:http://hdl.handle.net/1721.1/97694
https://orcid.org/0000-0003-2026-5631
_version_ 1811072508374286336
author Dahlem, Dominik
Maniloff, Diego
Ratti, Carlo
author2 Massachusetts Institute of Technology. Department of Urban Studies and Planning
author_facet Massachusetts Institute of Technology. Department of Urban Studies and Planning
Dahlem, Dominik
Maniloff, Diego
Ratti, Carlo
author_sort Dahlem, Dominik
collection MIT
description The ability to intervene in disease progression given a person’s disease history has the potential to solve one of society’s most pressing issues: advancing health care delivery and reducing its cost. Controlling disease progression is inherently associated with the ability to predict possible future diseases given a patient’s medical history. We invoke an information-theoretic methodology to quantify the level of predictability inherent in disease histories of a large electronic health records dataset with over half a million patients. In our analysis, we progress from zeroth order through temporal informed statistics, both from an individual patient’s standpoint and also considering the collective effects. Our findings confirm our intuition that knowledge of common disease progressions results in higher predictability bounds than treating disease histories independently. We complement this result by showing the point at which the temporal dependence structure vanishes with increasing orders of the time-correlated statistic. Surprisingly, we also show that shuffling individual disease histories only marginally degrades the predictability bounds. This apparent contradiction with respect to the importance of time-ordered information is indicative of the complexities involved in capturing the health-care process and the difficulties associated with utilising this information in universal prediction algorithms.
first_indexed 2024-09-23T09:07:05Z
format Article
id mit-1721.1/97694
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T09:07:05Z
publishDate 2015
publisher Nature Publishing Group
record_format dspace
spelling mit-1721.1/976942022-09-26T10:34:25Z Predictability Bounds of Electronic Health Records Dahlem, Dominik Maniloff, Diego Ratti, Carlo Massachusetts Institute of Technology. Department of Urban Studies and Planning Massachusetts Institute of Technology. SENSEable City Laboratory Dahlem, Dominik Maniloff, Diego Ratti, Carlo The ability to intervene in disease progression given a person’s disease history has the potential to solve one of society’s most pressing issues: advancing health care delivery and reducing its cost. Controlling disease progression is inherently associated with the ability to predict possible future diseases given a patient’s medical history. We invoke an information-theoretic methodology to quantify the level of predictability inherent in disease histories of a large electronic health records dataset with over half a million patients. In our analysis, we progress from zeroth order through temporal informed statistics, both from an individual patient’s standpoint and also considering the collective effects. Our findings confirm our intuition that knowledge of common disease progressions results in higher predictability bounds than treating disease histories independently. We complement this result by showing the point at which the temporal dependence structure vanishes with increasing orders of the time-correlated statistic. Surprisingly, we also show that shuffling individual disease histories only marginally degrades the predictability bounds. This apparent contradiction with respect to the importance of time-ordered information is indicative of the complexities involved in capturing the health-care process and the difficulties associated with utilising this information in universal prediction algorithms. General Electric Company AT&T Foundation National Science Foundation (U.S.) American Society for Engineering Education. National Defense Science and Engineering Graduate Fellowship Audi Volkswagen 2015-07-07T15:42:11Z 2015-07-07T15:42:11Z 2015-07 2013-12 Article http://purl.org/eprint/type/JournalArticle 2045-2322 http://hdl.handle.net/1721.1/97694 Dahlem, Dominik, Diego Maniloff, and Carlo Ratti. “Predictability Bounds of Electronic Health Records.” Scientific Reports 5 (July 7, 2015): 11865. https://orcid.org/0000-0003-2026-5631 en_US http://dx.doi.org/10.1038/srep11865 Scientific Reports Creative Commons Attribution http://creativecommons.org/licenses/by/4.0/ application/pdf Nature Publishing Group Nature
spellingShingle Dahlem, Dominik
Maniloff, Diego
Ratti, Carlo
Predictability Bounds of Electronic Health Records
title Predictability Bounds of Electronic Health Records
title_full Predictability Bounds of Electronic Health Records
title_fullStr Predictability Bounds of Electronic Health Records
title_full_unstemmed Predictability Bounds of Electronic Health Records
title_short Predictability Bounds of Electronic Health Records
title_sort predictability bounds of electronic health records
url http://hdl.handle.net/1721.1/97694
https://orcid.org/0000-0003-2026-5631
work_keys_str_mv AT dahlemdominik predictabilityboundsofelectronichealthrecords
AT maniloffdiego predictabilityboundsofelectronichealthrecords
AT ratticarlo predictabilityboundsofelectronichealthrecords