Word sense disambiguation of acronyms in clinical narratives
Clinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associa...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2024-02-01
|
Series: | Frontiers in Digital Health |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fdgth.2024.1282043/full |
_version_ | 1827339779565420544 |
---|---|
author | Daphné Chopard Padraig Corcoran Irena Spasić |
author_facet | Daphné Chopard Padraig Corcoran Irena Spasić |
author_sort | Daphné Chopard |
collection | DOAJ |
description | Clinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associated with patient privacy and manual annotation, which limit the size and diversity of training data. In this study, we demonstrate how scientific abstracts can be utilised to overcome these issues by creating a large automatically annotated dataset of artificially simulated global acronyms. A neural network trained on such a dataset achieved the F1-score of 95% on disambiguation of acronym mentions in scientific abstracts. This network was integrated with multi-word term recognition to extract a sense inventory of acronyms from a corpus of clinical narratives on the fly. Acronym sense extraction achieved the F1-score of 74% on a corpus of radiology reports. In clinical practice, the suggested approach can be used to facilitate development of institution-specific inventories. |
first_indexed | 2024-03-07T20:07:35Z |
format | Article |
id | doaj.art-0631eecbf4cb44608b4b52920bd2108c |
institution | Directory Open Access Journal |
issn | 2673-253X |
language | English |
last_indexed | 2024-03-07T20:07:35Z |
publishDate | 2024-02-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Digital Health |
spelling | doaj.art-0631eecbf4cb44608b4b52920bd2108c2024-02-28T04:40:00ZengFrontiers Media S.A.Frontiers in Digital Health2673-253X2024-02-01610.3389/fdgth.2024.12820431282043Word sense disambiguation of acronyms in clinical narrativesDaphné ChopardPadraig CorcoranIrena SpasićClinical narratives commonly use acronyms without explicitly defining their long forms. This makes it difficult to automatically interpret their sense as acronyms tend to be highly ambiguous. Supervised learning approaches to their disambiguation in the clinical domain are hindered by issues associated with patient privacy and manual annotation, which limit the size and diversity of training data. In this study, we demonstrate how scientific abstracts can be utilised to overcome these issues by creating a large automatically annotated dataset of artificially simulated global acronyms. A neural network trained on such a dataset achieved the F1-score of 95% on disambiguation of acronym mentions in scientific abstracts. This network was integrated with multi-word term recognition to extract a sense inventory of acronyms from a corpus of clinical narratives on the fly. Acronym sense extraction achieved the F1-score of 74% on a corpus of radiology reports. In clinical practice, the suggested approach can be used to facilitate development of institution-specific inventories.https://www.frontiersin.org/articles/10.3389/fdgth.2024.1282043/fullnatural language processingword sense disambiguationacronym disambiguationmachine learningdeep learningsilver standard |
spellingShingle | Daphné Chopard Padraig Corcoran Irena Spasić Word sense disambiguation of acronyms in clinical narratives Frontiers in Digital Health natural language processing word sense disambiguation acronym disambiguation machine learning deep learning silver standard |
title | Word sense disambiguation of acronyms in clinical narratives |
title_full | Word sense disambiguation of acronyms in clinical narratives |
title_fullStr | Word sense disambiguation of acronyms in clinical narratives |
title_full_unstemmed | Word sense disambiguation of acronyms in clinical narratives |
title_short | Word sense disambiguation of acronyms in clinical narratives |
title_sort | word sense disambiguation of acronyms in clinical narratives |
topic | natural language processing word sense disambiguation acronym disambiguation machine learning deep learning silver standard |
url | https://www.frontiersin.org/articles/10.3389/fdgth.2024.1282043/full |
work_keys_str_mv | AT daphnechopard wordsensedisambiguationofacronymsinclinicalnarratives AT padraigcorcoran wordsensedisambiguationofacronymsinclinicalnarratives AT irenaspasic wordsensedisambiguationofacronymsinclinicalnarratives |