Improving graph embeddings via entity linking: A case study on Italian clinical notes

The ever-increasing availability of Electronic Health Records (EHRs) is the key enabling factor of precision medicine, which aims to provide therapies and diagnoses based not only on medical literature, but also on clinical experience and individual information of patients (e.g. genomics, lifestyle,...

Full description

Bibliographic Details
Main Authors: Daniela D'Auria, Vincenzo Moscato, Marco Postiglione, Giuseppe Romito, Giancarlo Sperlí
Format: Article
Language:English
Published: Elsevier 2023-02-01
Series:Intelligent Systems with Applications
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2667305322000989
_version_ 1811170978669002752
author Daniela D'Auria
Vincenzo Moscato
Marco Postiglione
Giuseppe Romito
Giancarlo Sperlí
author_facet Daniela D'Auria
Vincenzo Moscato
Marco Postiglione
Giuseppe Romito
Giancarlo Sperlí
author_sort Daniela D'Auria
collection DOAJ
description The ever-increasing availability of Electronic Health Records (EHRs) is the key enabling factor of precision medicine, which aims to provide therapies and diagnoses based not only on medical literature, but also on clinical experience and individual information of patients (e.g. genomics, lifestyle, health history). The unstructured nature of EHRs has posed several challenges on their effective analysis, and heterogeneous graphs are the most suitable solution to handle the heterogeneity of information contained in EHRs. However, while EHRs are an extremely valuable data source, information from current medical literature has yet to be considered in clinical decision support systems. In this work, we build an heterogeneous graph from Italian EHRs provided by the Hospital of Naples Federico II, and we define a methodological workflow allowing us to predict the presence of a link between patients and diagnosed diseases. We empirically demonstrate that linking concepts to biomedical ontologies (e.g. UMLS, DBpedia) — which allow us to extract entities and relationships from medical literature — is significantly beneficial to our link-prediction workflow in terms of Area Under the ROC curve (AUC) and Mean Reciprocal Rank (MRR).
first_indexed 2024-04-10T17:05:48Z
format Article
id doaj.art-01a708583b38493a8bd0e4a833301554
institution Directory Open Access Journal
issn 2667-3053
language English
last_indexed 2024-04-10T17:05:48Z
publishDate 2023-02-01
publisher Elsevier
record_format Article
series Intelligent Systems with Applications
spelling doaj.art-01a708583b38493a8bd0e4a8333015542023-02-06T04:06:23ZengElsevierIntelligent Systems with Applications2667-30532023-02-0117200161Improving graph embeddings via entity linking: A case study on Italian clinical notesDaniela D'Auria0Vincenzo Moscato1Marco Postiglione2Giuseppe Romito3Giancarlo Sperlí4Faculty of Computer Science, Free University of Bozen-Bolzano, Bozen-Bolzano, 39100, ItalyUniversity of Naples “Federico II”, Dept. of Electrical Engineering and Information Technology (DIETI), Via Claudio 21, 80125, ItalyUniversity of Naples “Federico II”, Dept. of Electrical Engineering and Information Technology (DIETI), Via Claudio 21, 80125, ItalyUniversity of Naples “Federico II”, Dept. of Electrical Engineering and Information Technology (DIETI), Via Claudio 21, 80125, ItalyUniversity of Naples “Federico II”, Dept. of Electrical Engineering and Information Technology (DIETI), Via Claudio 21, 80125, Italy; Corresponding author.The ever-increasing availability of Electronic Health Records (EHRs) is the key enabling factor of precision medicine, which aims to provide therapies and diagnoses based not only on medical literature, but also on clinical experience and individual information of patients (e.g. genomics, lifestyle, health history). The unstructured nature of EHRs has posed several challenges on their effective analysis, and heterogeneous graphs are the most suitable solution to handle the heterogeneity of information contained in EHRs. However, while EHRs are an extremely valuable data source, information from current medical literature has yet to be considered in clinical decision support systems. In this work, we build an heterogeneous graph from Italian EHRs provided by the Hospital of Naples Federico II, and we define a methodological workflow allowing us to predict the presence of a link between patients and diagnosed diseases. We empirically demonstrate that linking concepts to biomedical ontologies (e.g. UMLS, DBpedia) — which allow us to extract entities and relationships from medical literature — is significantly beneficial to our link-prediction workflow in terms of Area Under the ROC curve (AUC) and Mean Reciprocal Rank (MRR).http://www.sciencedirect.com/science/article/pii/S2667305322000989Entity linkingGraph embeddingLink predictionHealth analyticsHealthcare
spellingShingle Daniela D'Auria
Vincenzo Moscato
Marco Postiglione
Giuseppe Romito
Giancarlo Sperlí
Improving graph embeddings via entity linking: A case study on Italian clinical notes
Intelligent Systems with Applications
Entity linking
Graph embedding
Link prediction
Health analytics
Healthcare
title Improving graph embeddings via entity linking: A case study on Italian clinical notes
title_full Improving graph embeddings via entity linking: A case study on Italian clinical notes
title_fullStr Improving graph embeddings via entity linking: A case study on Italian clinical notes
title_full_unstemmed Improving graph embeddings via entity linking: A case study on Italian clinical notes
title_short Improving graph embeddings via entity linking: A case study on Italian clinical notes
title_sort improving graph embeddings via entity linking a case study on italian clinical notes
topic Entity linking
Graph embedding
Link prediction
Health analytics
Healthcare
url http://www.sciencedirect.com/science/article/pii/S2667305322000989
work_keys_str_mv AT danieladauria improvinggraphembeddingsviaentitylinkingacasestudyonitalianclinicalnotes
AT vincenzomoscato improvinggraphembeddingsviaentitylinkingacasestudyonitalianclinicalnotes
AT marcopostiglione improvinggraphembeddingsviaentitylinkingacasestudyonitalianclinicalnotes
AT giusepperomito improvinggraphembeddingsviaentitylinkingacasestudyonitalianclinicalnotes
AT giancarlosperli improvinggraphembeddingsviaentitylinkingacasestudyonitalianclinicalnotes