Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label

The construction of medical knowledge graphs (MKGs) is steadily progressing from manual to automatic methods, which inevitably introduce noise, which could impair the performance of downstream healthcare applications. Existing error detection approaches depend on the topological structure and extern...

Full description

Bibliographic Details
Main Authors: Guangya Yu, Qi Ye, Tong Ruan
Format: Article
Language:English
Published: MDPI AG 2024-02-01
Series:Bioengineering
Subjects:
Online Access:https://www.mdpi.com/2306-5354/11/3/225
_version_ 1797241997040287744
author Guangya Yu
Qi Ye
Tong Ruan
author_facet Guangya Yu
Qi Ye
Tong Ruan
author_sort Guangya Yu
collection DOAJ
description The construction of medical knowledge graphs (MKGs) is steadily progressing from manual to automatic methods, which inevitably introduce noise, which could impair the performance of downstream healthcare applications. Existing error detection approaches depend on the topological structure and external labels of entities in MKGs to improve their quality. Nevertheless, due to the cost of manual annotation and imperfect automatic algorithms, precise entity labels in MKGs cannot be readily obtained. To address these issues, we propose an approach named Enhancing error detection on Medical knowledge graphs via intrinsic labEL (EMKGEL). Considering the absence of hyper-view KG, we establish a hyper-view KG and a triplet-level KG for implicit label information and neighborhood information, respectively. Inspired by the success of graph attention networks (GATs), we introduce the hyper-view GAT to incorporate label messages and neighborhood information into representation learning. We leverage a confidence score that combines local and global trustworthiness to estimate the triplets. To validate the effectiveness of our approach, we conducted experiments on three publicly available MKGs, namely PharmKG-8k, DiseaseKG, and DiaKG. Compared with the baseline models, the Precision@K value improved by 0.7%, 6.1%, and 3.6%, respectively, on these datasets. Furthermore, our method empirically showed that it significantly outperformed the baseline on a general knowledge graph, Nell-995.
first_indexed 2024-04-24T18:32:12Z
format Article
id doaj.art-536a75f02a5b4ce784aef1f321d0572c
institution Directory Open Access Journal
issn 2306-5354
language English
last_indexed 2024-04-24T18:32:12Z
publishDate 2024-02-01
publisher MDPI AG
record_format Article
series Bioengineering
spelling doaj.art-536a75f02a5b4ce784aef1f321d0572c2024-03-27T13:21:48ZengMDPI AGBioengineering2306-53542024-02-0111322510.3390/bioengineering11030225Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic LabelGuangya Yu0Qi Ye1Tong Ruan2Zhejiang Laboratory, Hangzhou 311121, ChinaSchool of Information Science and Technology, East China University of Science and Technology, Shanghai 200237, ChinaSchool of Information Science and Technology, East China University of Science and Technology, Shanghai 200237, ChinaThe construction of medical knowledge graphs (MKGs) is steadily progressing from manual to automatic methods, which inevitably introduce noise, which could impair the performance of downstream healthcare applications. Existing error detection approaches depend on the topological structure and external labels of entities in MKGs to improve their quality. Nevertheless, due to the cost of manual annotation and imperfect automatic algorithms, precise entity labels in MKGs cannot be readily obtained. To address these issues, we propose an approach named Enhancing error detection on Medical knowledge graphs via intrinsic labEL (EMKGEL). Considering the absence of hyper-view KG, we establish a hyper-view KG and a triplet-level KG for implicit label information and neighborhood information, respectively. Inspired by the success of graph attention networks (GATs), we introduce the hyper-view GAT to incorporate label messages and neighborhood information into representation learning. We leverage a confidence score that combines local and global trustworthiness to estimate the triplets. To validate the effectiveness of our approach, we conducted experiments on three publicly available MKGs, namely PharmKG-8k, DiseaseKG, and DiaKG. Compared with the baseline models, the Precision@K value improved by 0.7%, 6.1%, and 3.6%, respectively, on these datasets. Furthermore, our method empirically showed that it significantly outperformed the baseline on a general knowledge graph, Nell-995.https://www.mdpi.com/2306-5354/11/3/225medical knowledge grapherror detectionconfidence scoregraph attention network
spellingShingle Guangya Yu
Qi Ye
Tong Ruan
Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
Bioengineering
medical knowledge graph
error detection
confidence score
graph attention network
title Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_full Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_fullStr Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_full_unstemmed Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_short Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_sort enhancing error detection on medical knowledge graphs via intrinsic label
topic medical knowledge graph
error detection
confidence score
graph attention network
url https://www.mdpi.com/2306-5354/11/3/225
work_keys_str_mv AT guangyayu enhancingerrordetectiononmedicalknowledgegraphsviaintrinsiclabel
AT qiye enhancingerrordetectiononmedicalknowledgegraphsviaintrinsiclabel
AT tongruan enhancingerrordetectiononmedicalknowledgegraphsviaintrinsiclabel