Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study

BackgroundDelirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate. ObjectiveWe sought to...

Full description

Bibliographic Details
Main Authors: Wendong Ge, Haitham Alabsi, Aayushee Jain, Elissa Ye, Haoqi Sun, Marta Fernandes, Colin Magdamo, Ryan A Tesh, Sarah I Collens, Amy Newhouse, Lidia MVR Moura, Sahar Zafar, John Hsu, Oluwaseun Akeju, Gregory K Robbins, Shibani S Mukerji, Sudeshna Das, M Brandon Westover
Format: Article
Language:English
Published: JMIR Publications 2022-06-01
Series:JMIR Formative Research
Online Access:https://formative.jmir.org/2022/6/e33834
_version_ 1827858230829645824
author Wendong Ge
Haitham Alabsi
Aayushee Jain
Elissa Ye
Haoqi Sun
Marta Fernandes
Colin Magdamo
Ryan A Tesh
Sarah I Collens
Amy Newhouse
Lidia MVR Moura
Sahar Zafar
John Hsu
Oluwaseun Akeju
Gregory K Robbins
Shibani S Mukerji
Sudeshna Das
M Brandon Westover
author_facet Wendong Ge
Haitham Alabsi
Aayushee Jain
Elissa Ye
Haoqi Sun
Marta Fernandes
Colin Magdamo
Ryan A Tesh
Sarah I Collens
Amy Newhouse
Lidia MVR Moura
Sahar Zafar
John Hsu
Oluwaseun Akeju
Gregory K Robbins
Shibani S Mukerji
Sudeshna Das
M Brandon Westover
author_sort Wendong Ge
collection DOAJ
description BackgroundDelirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate. ObjectiveWe sought to develop a more accurate method using natural language processing (NLP) to detect delirium episodes on the basis of unstructured clinical notes. MethodsWe collected 1.5 million notes from >10,000 patients from among 9 hospitals. Seven experts iteratively labeled 200,471 sentences. Using these, we trained three NLP classifiers: Support Vector Machine, Recurrent Neural Networks, and Transformer. Testing was performed using an external data set. We also evaluated associations with delirium billing (ICD) codes, medications, orders for restraints and sitters, direct assessments (Confusion Assessment Method [CAM] scores), and in-hospital mortality. F1 scores, confusion matrices, and areas under the receiver operating characteristic curve (AUCs) were used to compare NLP models. We used the φ coefficient to measure associations with other delirium indicators. ResultsThe transformer NLP performed best on the following parameters: micro F1=0.978, macro F1=0.918, positive AUC=0.984, and negative AUC=0.992. NLP detections exhibited higher correlations (φ) than ICD codes with deliriogenic medications (0.194 vs 0.073 for ICD codes), restraints and sitter orders (0.358 vs 0.177), mortality (0.216 vs 0.000), and CAM scores (0.256 vs –0.028). ConclusionsClinical notes are an attractive alternative to ICD codes for EHR delirium studies but require automated methods. Our NLP model detects delirium with high accuracy, similar to manual chart review. Our NLP approach can provide more accurate determination of delirium for large-scale EHR-based studies regarding delirium, quality improvement, and clinical trails.
first_indexed 2024-03-12T12:51:57Z
format Article
id doaj.art-bb6606f0948349e6b2113cfa0a866124
institution Directory Open Access Journal
issn 2561-326X
language English
last_indexed 2024-03-12T12:51:57Z
publishDate 2022-06-01
publisher JMIR Publications
record_format Article
series JMIR Formative Research
spelling doaj.art-bb6606f0948349e6b2113cfa0a8661242023-08-28T22:22:05ZengJMIR PublicationsJMIR Formative Research2561-326X2022-06-0166e3383410.2196/33834Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational StudyWendong Gehttps://orcid.org/0000-0003-1557-5336Haitham Alabsihttps://orcid.org/0000-0001-6354-4679Aayushee Jainhttps://orcid.org/0000-0002-5018-3234Elissa Yehttps://orcid.org/0000-0003-4851-6543Haoqi Sunhttps://orcid.org/0000-0002-5041-8312Marta Fernandeshttps://orcid.org/0000-0002-7203-2832Colin Magdamohttps://orcid.org/0000-0001-8965-4630Ryan A Teshhttps://orcid.org/0000-0002-6154-6248Sarah I Collenshttps://orcid.org/0000-0001-7010-7266Amy Newhousehttps://orcid.org/0000-0002-7392-4242Lidia MVR Mourahttps://orcid.org/0000-0002-1191-1315Sahar Zafarhttps://orcid.org/0000-0001-5252-5376John Hsuhttps://orcid.org/0000-0001-8244-231XOluwaseun Akejuhttps://orcid.org/0000-0002-6740-1250Gregory K Robbinshttps://orcid.org/0000-0001-7545-5817Shibani S Mukerjihttps://orcid.org/0000-0002-5677-6954Sudeshna Dashttps://orcid.org/0000-0002-9486-6811M Brandon Westoverhttps://orcid.org/0000-0003-4803-312X BackgroundDelirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate. ObjectiveWe sought to develop a more accurate method using natural language processing (NLP) to detect delirium episodes on the basis of unstructured clinical notes. MethodsWe collected 1.5 million notes from >10,000 patients from among 9 hospitals. Seven experts iteratively labeled 200,471 sentences. Using these, we trained three NLP classifiers: Support Vector Machine, Recurrent Neural Networks, and Transformer. Testing was performed using an external data set. We also evaluated associations with delirium billing (ICD) codes, medications, orders for restraints and sitters, direct assessments (Confusion Assessment Method [CAM] scores), and in-hospital mortality. F1 scores, confusion matrices, and areas under the receiver operating characteristic curve (AUCs) were used to compare NLP models. We used the φ coefficient to measure associations with other delirium indicators. ResultsThe transformer NLP performed best on the following parameters: micro F1=0.978, macro F1=0.918, positive AUC=0.984, and negative AUC=0.992. NLP detections exhibited higher correlations (φ) than ICD codes with deliriogenic medications (0.194 vs 0.073 for ICD codes), restraints and sitter orders (0.358 vs 0.177), mortality (0.216 vs 0.000), and CAM scores (0.256 vs –0.028). ConclusionsClinical notes are an attractive alternative to ICD codes for EHR delirium studies but require automated methods. Our NLP model detects delirium with high accuracy, similar to manual chart review. Our NLP approach can provide more accurate determination of delirium for large-scale EHR-based studies regarding delirium, quality improvement, and clinical trails.https://formative.jmir.org/2022/6/e33834
spellingShingle Wendong Ge
Haitham Alabsi
Aayushee Jain
Elissa Ye
Haoqi Sun
Marta Fernandes
Colin Magdamo
Ryan A Tesh
Sarah I Collens
Amy Newhouse
Lidia MVR Moura
Sahar Zafar
John Hsu
Oluwaseun Akeju
Gregory K Robbins
Shibani S Mukerji
Sudeshna Das
M Brandon Westover
Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study
JMIR Formative Research
title Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study
title_full Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study
title_fullStr Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study
title_full_unstemmed Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study
title_short Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study
title_sort identifying patients with delirium based on unstructured clinical notes observational study
url https://formative.jmir.org/2022/6/e33834
work_keys_str_mv AT wendongge identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT haithamalabsi identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT aayusheejain identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT elissaye identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT haoqisun identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT martafernandes identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT colinmagdamo identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT ryanatesh identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT sarahicollens identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT amynewhouse identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT lidiamvrmoura identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT saharzafar identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT johnhsu identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT oluwaseunakeju identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT gregorykrobbins identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT shibanismukerji identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT sudeshnadas identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy
AT mbrandonwestover identifyingpatientswithdeliriumbasedonunstructuredclinicalnotesobservationalstudy