Automatically identifying social isolation from clinical narratives for patients with prostate Cancer

Abstract Background Social isolation is an important social determinant that impacts health outcomes and mortality among patients. The National Academy of Medicine recently recommended that social isolation be documented in electronic health records (EHR). However, social isolation usually is not re...

Full description

Bibliographic Details
Main Authors: Vivienne J Zhu, Leslie A Lenert, Brian E Bunnell, Jihad S Obeid, Melanie Jefferson, Chanita Hughes Halbert
Format: Article
Language:English
Published: BMC 2019-03-01
Series:BMC Medical Informatics and Decision Making
Online Access:http://link.springer.com/article/10.1186/s12911-019-0795-y
_version_ 1818953319654096896
author Vivienne J Zhu
Leslie A Lenert
Brian E Bunnell
Jihad S Obeid
Melanie Jefferson
Chanita Hughes Halbert
author_facet Vivienne J Zhu
Leslie A Lenert
Brian E Bunnell
Jihad S Obeid
Melanie Jefferson
Chanita Hughes Halbert
author_sort Vivienne J Zhu
collection DOAJ
description Abstract Background Social isolation is an important social determinant that impacts health outcomes and mortality among patients. The National Academy of Medicine recently recommended that social isolation be documented in electronic health records (EHR). However, social isolation usually is not recorded or obtained as coded data but rather collected from patient self-report or documented in clinical narratives. This study explores the feasibility and effectiveness of natural language processing (NLP) strategy for identifying patients who are socially isolated from clinical narratives. Method We used data from the Medical University of South Carolina (MUSC) Research Data Warehouse. Patients 18 years-of-age or older who were diagnosed with prostate cancer between January 1, 2014 and May 31, 2017 were eligible for this study. NLP pipelines identifying social isolation were developed via extraction of notes on progress, history and physical, consult, emergency department provider, telephone encounter, discharge summary, plan of care, and radiation oncology. Of 4195 eligible prostate cancer patients, we randomly sampled 3138 patients (75%) as a training dataset. The remaining 1057 patients (25%) were used as a test dataset to evaluate NLP algorithm performance. Standard performance measures for the NLP algorithm, including precision, recall, and F-measure, were assessed by expert manual review using the test dataset. Results A total of 55,516 clinical notes from 3138 patients were included to develop the lexicon and NLP pipelines for social isolation. Of those, 35 unique patients (1.2%) had social isolation mention(s) in 217 notes. Among 24 terms relevant to social isolation, the most prevalent were “lack of social support,” “lonely,” “social isolation,” “no friends,” and “loneliness”. Among 1057 patients in the test dataset, 17 patients (1.6%) were identified as having social isolation mention(s) in 40 clinical notes. Manual review identified four false positive mentions of social isolation and one false negatives in 154 notes from randomly selected 52 controls. The NLP pipeline demonstrated 90% precision, 97% recall, and 93% F-measure. The major reasons for a false positive included the ambiguities of the experiencer of social isolation, negation, and alternate meaning of words. Conclusions Our NLP algorithms demonstrate a highly accurate approach to identify social isolation.
first_indexed 2024-12-20T10:04:23Z
format Article
id doaj.art-1fd9561a87b944c89440e78fd1fed7f9
institution Directory Open Access Journal
issn 1472-6947
language English
last_indexed 2024-12-20T10:04:23Z
publishDate 2019-03-01
publisher BMC
record_format Article
series BMC Medical Informatics and Decision Making
spelling doaj.art-1fd9561a87b944c89440e78fd1fed7f92022-12-21T19:44:15ZengBMCBMC Medical Informatics and Decision Making1472-69472019-03-011911910.1186/s12911-019-0795-yAutomatically identifying social isolation from clinical narratives for patients with prostate CancerVivienne J Zhu0Leslie A Lenert1Brian E Bunnell2Jihad S Obeid3Melanie Jefferson4Chanita Hughes Halbert5Biomedical Informatics Center at Medical University of South CarolinaBiomedical Informatics Center at Medical University of South CarolinaBiomedical Informatics Center at Medical University of South CarolinaBiomedical Informatics Center at Medical University of South CarolinaHolling Cancer Center and Department of Psychiatry and Behavioral Sciences at Medical University of South CarolinaHolling Cancer Center and Department of Psychiatry and Behavioral Sciences at Medical University of South CarolinaAbstract Background Social isolation is an important social determinant that impacts health outcomes and mortality among patients. The National Academy of Medicine recently recommended that social isolation be documented in electronic health records (EHR). However, social isolation usually is not recorded or obtained as coded data but rather collected from patient self-report or documented in clinical narratives. This study explores the feasibility and effectiveness of natural language processing (NLP) strategy for identifying patients who are socially isolated from clinical narratives. Method We used data from the Medical University of South Carolina (MUSC) Research Data Warehouse. Patients 18 years-of-age or older who were diagnosed with prostate cancer between January 1, 2014 and May 31, 2017 were eligible for this study. NLP pipelines identifying social isolation were developed via extraction of notes on progress, history and physical, consult, emergency department provider, telephone encounter, discharge summary, plan of care, and radiation oncology. Of 4195 eligible prostate cancer patients, we randomly sampled 3138 patients (75%) as a training dataset. The remaining 1057 patients (25%) were used as a test dataset to evaluate NLP algorithm performance. Standard performance measures for the NLP algorithm, including precision, recall, and F-measure, were assessed by expert manual review using the test dataset. Results A total of 55,516 clinical notes from 3138 patients were included to develop the lexicon and NLP pipelines for social isolation. Of those, 35 unique patients (1.2%) had social isolation mention(s) in 217 notes. Among 24 terms relevant to social isolation, the most prevalent were “lack of social support,” “lonely,” “social isolation,” “no friends,” and “loneliness”. Among 1057 patients in the test dataset, 17 patients (1.6%) were identified as having social isolation mention(s) in 40 clinical notes. Manual review identified four false positive mentions of social isolation and one false negatives in 154 notes from randomly selected 52 controls. The NLP pipeline demonstrated 90% precision, 97% recall, and 93% F-measure. The major reasons for a false positive included the ambiguities of the experiencer of social isolation, negation, and alternate meaning of words. Conclusions Our NLP algorithms demonstrate a highly accurate approach to identify social isolation.http://link.springer.com/article/10.1186/s12911-019-0795-y
spellingShingle Vivienne J Zhu
Leslie A Lenert
Brian E Bunnell
Jihad S Obeid
Melanie Jefferson
Chanita Hughes Halbert
Automatically identifying social isolation from clinical narratives for patients with prostate Cancer
BMC Medical Informatics and Decision Making
title Automatically identifying social isolation from clinical narratives for patients with prostate Cancer
title_full Automatically identifying social isolation from clinical narratives for patients with prostate Cancer
title_fullStr Automatically identifying social isolation from clinical narratives for patients with prostate Cancer
title_full_unstemmed Automatically identifying social isolation from clinical narratives for patients with prostate Cancer
title_short Automatically identifying social isolation from clinical narratives for patients with prostate Cancer
title_sort automatically identifying social isolation from clinical narratives for patients with prostate cancer
url http://link.springer.com/article/10.1186/s12911-019-0795-y
work_keys_str_mv AT viviennejzhu automaticallyidentifyingsocialisolationfromclinicalnarrativesforpatientswithprostatecancer
AT lesliealenert automaticallyidentifyingsocialisolationfromclinicalnarrativesforpatientswithprostatecancer
AT brianebunnell automaticallyidentifyingsocialisolationfromclinicalnarrativesforpatientswithprostatecancer
AT jihadsobeid automaticallyidentifyingsocialisolationfromclinicalnarrativesforpatientswithprostatecancer
AT melaniejefferson automaticallyidentifyingsocialisolationfromclinicalnarrativesforpatientswithprostatecancer
AT chanitahugheshalbert automaticallyidentifyingsocialisolationfromclinicalnarrativesforpatientswithprostatecancer