2182

OBJECTIVES/SPECIFIC AIMS: An accurate method to identify bleeding in large populations does not exist. Our goal was to explore bleeding representation in clinical text in order to develop a natural language processing (NLP) approach to automatically identify bleeding events from clinical notes. METH...

Full description

Bibliographic Details
Main Authors:	Rashmee Shah, Benjamin Steinberg, Brian Bucher, Alec Chapman, Donald Lloyd-Jones, Matthew Rondina, Wendy Chapman
Format:	Article
Language:	English
Published:	Cambridge University Press 2017-09-01
Series:	Journal of Clinical and Translational Science
Online Access:	https://www.cambridge.org/core/product/identifier/S2059866117000607/type/journal_article

_version_	1827995505380032512
author	Rashmee Shah Benjamin Steinberg Brian Bucher Alec Chapman Donald Lloyd-Jones Matthew Rondina Wendy Chapman
author_facet	Rashmee Shah Benjamin Steinberg Brian Bucher Alec Chapman Donald Lloyd-Jones Matthew Rondina Wendy Chapman
author_sort	Rashmee Shah
collection	DOAJ
description	OBJECTIVES/SPECIFIC AIMS: An accurate method to identify bleeding in large populations does not exist. Our goal was to explore bleeding representation in clinical text in order to develop a natural language processing (NLP) approach to automatically identify bleeding events from clinical notes. METHODS/STUDY POPULATION: We used publicly available notes for ICU patients at high risk of bleeding (n=98,586 notes). Two physicians reviewed randomly selected notes and annotated all direct references to bleeding as “bleeding present” (BP) or “bleeding absent” (BA). Annotations were made at the mention level (if 1 specific sentence/phrase indicated BP or BA) and note level (if overall note indicated BP or BA). A third physician adjudicated discordant annotations. RESULTS/ANTICIPATED RESULTS: In 120 randomly selected notes, bleeding was mentioned 406 times with 76 distinct words. Inter-annotator agreement was 89% by the last batch of 30 notes. In total, 10 terms accounted for 65% of all bleeding mentions. We aggregated these results into 16 common stems (eg, “hemorr” for hemorrhagic and hemorrhage), which accounted for 90% of all 406 mentions. Of all 120 notes, 60% were classified as BP. The median number of stems was 5 (IQR 2, 9) in BP Versus 0 (IQR 0, 1) in BA notes. Zero bleeding mentions in a note was associated with BA (OR 28, 95% CI 6.5, 127). With 40 true negatives and 2 false negatives, the negative predictive value (NPV) of zero bleeding mentions was 95%. DISCUSSION/SIGNIFICANCE OF IMPACT: Few bleeding-related terms are used in clinical practice. Absence of these terms has a high NPV for the absence of bleeding. These results suggest that a high throughput, rules-based NLP tool to identify bleeding is feasible.
first_indexed	2024-04-10T04:56:29Z
format	Article
id	doaj.art-f89abe8d2ac64c86be37942c18bcd77a
institution	Directory Open Access Journal
issn	2059-8661
language	English
last_indexed	2024-04-10T04:56:29Z
publishDate	2017-09-01
publisher	Cambridge University Press
record_format	Article
series	Journal of Clinical and Translational Science
spelling	doaj.art-f89abe8d2ac64c86be37942c18bcd77a2023-03-09T12:30:07ZengCambridge University PressJournal of Clinical and Translational Science2059-86612017-09-011121210.1017/cts.2017.602182Rashmee Shah0Benjamin Steinberg1Brian Bucher2Alec Chapman3Donald Lloyd-Jones4Matthew Rondina5Wendy Chapman6The University of Utah School of Medicine, Salt Lake City, UT, USAThe University of Utah School of Medicine, Salt Lake City, UT, USAThe University of Utah School of Medicine, Salt Lake City, UT, USAThe University of Utah School of Medicine, Salt Lake City, UT, USAThe University of Utah School of Medicine, Salt Lake City, UT, USAThe University of Utah School of Medicine, Salt Lake City, UT, USAThe University of Utah School of Medicine, Salt Lake City, UT, USAOBJECTIVES/SPECIFIC AIMS: An accurate method to identify bleeding in large populations does not exist. Our goal was to explore bleeding representation in clinical text in order to develop a natural language processing (NLP) approach to automatically identify bleeding events from clinical notes. METHODS/STUDY POPULATION: We used publicly available notes for ICU patients at high risk of bleeding (n=98,586 notes). Two physicians reviewed randomly selected notes and annotated all direct references to bleeding as “bleeding present” (BP) or “bleeding absent” (BA). Annotations were made at the mention level (if 1 specific sentence/phrase indicated BP or BA) and note level (if overall note indicated BP or BA). A third physician adjudicated discordant annotations. RESULTS/ANTICIPATED RESULTS: In 120 randomly selected notes, bleeding was mentioned 406 times with 76 distinct words. Inter-annotator agreement was 89% by the last batch of 30 notes. In total, 10 terms accounted for 65% of all bleeding mentions. We aggregated these results into 16 common stems (eg, “hemorr” for hemorrhagic and hemorrhage), which accounted for 90% of all 406 mentions. Of all 120 notes, 60% were classified as BP. The median number of stems was 5 (IQR 2, 9) in BP Versus 0 (IQR 0, 1) in BA notes. Zero bleeding mentions in a note was associated with BA (OR 28, 95% CI 6.5, 127). With 40 true negatives and 2 false negatives, the negative predictive value (NPV) of zero bleeding mentions was 95%. DISCUSSION/SIGNIFICANCE OF IMPACT: Few bleeding-related terms are used in clinical practice. Absence of these terms has a high NPV for the absence of bleeding. These results suggest that a high throughput, rules-based NLP tool to identify bleeding is feasible.https://www.cambridge.org/core/product/identifier/S2059866117000607/type/journal_article
spellingShingle	Rashmee Shah Benjamin Steinberg Brian Bucher Alec Chapman Donald Lloyd-Jones Matthew Rondina Wendy Chapman 2182 Journal of Clinical and Translational Science
title	2182
title_full	2182
title_fullStr	2182
title_full_unstemmed	2182
title_short	2182
title_sort	2182
url	https://www.cambridge.org/core/product/identifier/S2059866117000607/type/journal_article
work_keys_str_mv	AT rashmeeshah 2182 AT benjaminsteinberg 2182 AT brianbucher 2182 AT alecchapman 2182 AT donaldlloydjones 2182 AT matthewrondina 2182 AT wendychapman 2182

2182

Similar Items