Word sense disambiguation of acronyms in clinical narratives

Clinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associa...

Full description

Bibliographic Details
Main Authors: Daphné Chopard, Padraig Corcoran, Irena Spasić
Format: Article
Language:English
Published: Frontiers Media S.A. 2024-02-01
Series:Frontiers in Digital Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fdgth.2024.1282043/full
_version_ 1827339779565420544
author Daphné Chopard
Padraig Corcoran
Irena Spasić
author_facet Daphné Chopard
Padraig Corcoran
Irena Spasić
author_sort Daphné Chopard
collection DOAJ
description Clinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associated with patient privacy and manual annotation, which limit the size and diversity of training data. In this study, we demonstrate how scientific abstracts can be utilised to overcome these issues by creating a large automatically annotated dataset of artificially simulated global acronyms. A neural network trained on such a dataset achieved the F1-score of 95% on disambiguation of acronym mentions in scientific abstracts. This network was integrated with multi-word term recognition to extract a sense inventory of acronyms from a corpus of clinical narratives on the fly. Acronym sense extraction achieved the F1-score of 74% on a corpus of radiology reports. In clinical practice, the suggested approach can be used to facilitate development of institution-specific inventories.
first_indexed 2024-03-07T20:07:35Z
format Article
id doaj.art-0631eecbf4cb44608b4b52920bd2108c
institution Directory Open Access Journal
issn 2673-253X
language English
last_indexed 2024-03-07T20:07:35Z
publishDate 2024-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Digital Health
spelling doaj.art-0631eecbf4cb44608b4b52920bd2108c2024-02-28T04:40:00ZengFrontiers Media S.A.Frontiers in Digital Health2673-253X2024-02-01610.3389/fdgth.2024.12820431282043Word sense disambiguation of acronyms in clinical narrativesDaphné ChopardPadraig CorcoranIrena SpasićClinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associated with patient privacy and manual annotation, which limit the size and diversity of training data. In this study, we demonstrate how scientific abstracts can be utilised to overcome these issues by creating a large automatically annotated dataset of artificially simulated global acronyms. A neural network trained on such a dataset achieved the F1-score of 95% on disambiguation of acronym mentions in scientific abstracts. This network was integrated with multi-word term recognition to extract a sense inventory of acronyms from a corpus of clinical narratives on the fly. Acronym sense extraction achieved the F1-score of 74% on a corpus of radiology reports. In clinical practice, the suggested approach can be used to facilitate development of institution-specific inventories.https://www.frontiersin.org/articles/10.3389/fdgth.2024.1282043/fullnatural language processingword sense disambiguationacronym disambiguationmachine learningdeep learningsilver standard
spellingShingle Daphné Chopard
Padraig Corcoran
Irena Spasić
Word sense disambiguation of acronyms in clinical narratives
Frontiers in Digital Health
natural language processing
word sense disambiguation
acronym disambiguation
machine learning
deep learning
silver standard
title Word sense disambiguation of acronyms in clinical narratives
title_full Word sense disambiguation of acronyms in clinical narratives
title_fullStr Word sense disambiguation of acronyms in clinical narratives
title_full_unstemmed Word sense disambiguation of acronyms in clinical narratives
title_short Word sense disambiguation of acronyms in clinical narratives
title_sort word sense disambiguation of acronyms in clinical narratives
topic natural language processing
word sense disambiguation
acronym disambiguation
machine learning
deep learning
silver standard
url https://www.frontiersin.org/articles/10.3389/fdgth.2024.1282043/full
work_keys_str_mv AT daphnechopard wordsensedisambiguationofacronymsinclinicalnarratives
AT padraigcorcoran wordsensedisambiguationofacronymsinclinicalnarratives
AT irenaspasic wordsensedisambiguationofacronymsinclinicalnarratives