Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label

The construction of medical knowledge graphs (MKGs) is steadily progressing from manual to automatic methods, which inevitably introduce noise, which could impair the performance of downstream healthcare applications. Existing error detection approaches depend on the topological structure and extern...

Full description

Bibliographic Details
Main Authors:	Guangya Yu, Qi Ye, Tong Ruan
Format:	Article
Language:	English
Published:	MDPI AG 2024-02-01
Series:	Bioengineering
Subjects:	medical knowledge graph error detection confidence score graph attention network
Online Access:	https://www.mdpi.com/2306-5354/11/3/225

_version_	1827307017447931904
author	Guangya Yu Qi Ye Tong Ruan
author_facet	Guangya Yu Qi Ye Tong Ruan
author_sort	Guangya Yu
collection	DOAJ
description	The construction of medical knowledge graphs (MKGs) is steadily progressing from manual to automatic methods, which inevitably introduce noise, which could impair the performance of downstream healthcare applications. Existing error detection approaches depend on the topological structure and external labels of entities in MKGs to improve their quality. Nevertheless, due to the cost of manual annotation and imperfect automatic algorithms, precise entity labels in MKGs cannot be readily obtained. To address these issues, we propose an approach named Enhancing error detection on Medical knowledge graphs via intrinsic labEL (EMKGEL). Considering the absence of hyper-view KG, we establish a hyper-view KG and a triplet-level KG for implicit label information and neighborhood information, respectively. Inspired by the success of graph attention networks (GATs), we introduce the hyper-view GAT to incorporate label messages and neighborhood information into representation learning. We leverage a confidence score that combines local and global trustworthiness to estimate the triplets. To validate the effectiveness of our approach, we conducted experiments on three publicly available MKGs, namely PharmKG-8k, DiseaseKG, and DiaKG. Compared with the baseline models, the Precision@K value improved by 0.7%, 6.1%, and 3.6%, respectively, on these datasets. Furthermore, our method empirically showed that it significantly outperformed the baseline on a general knowledge graph, Nell-995.
first_indexed	2024-04-24T18:32:12Z
format	Article
id	doaj.art-536a75f02a5b4ce784aef1f321d0572c
institution	Directory Open Access Journal
issn	2306-5354
language	English
last_indexed	2024-04-24T18:32:12Z
publishDate	2024-02-01
publisher	MDPI AG
record_format	Article
series	Bioengineering
spelling	doaj.art-536a75f02a5b4ce784aef1f321d0572c2024-03-27T13:21:48ZengMDPI AGBioengineering2306-53542024-02-0111322510.3390/bioengineering11030225Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic LabelGuangya Yu0Qi Ye1Tong Ruan2Zhejiang Laboratory, Hangzhou 311121, ChinaSchool of Information Science and Technology, East China University of Science and Technology, Shanghai 200237, ChinaSchool of Information Science and Technology, East China University of Science and Technology, Shanghai 200237, ChinaThe construction of medical knowledge graphs (MKGs) is steadily progressing from manual to automatic methods, which inevitably introduce noise, which could impair the performance of downstream healthcare applications. Existing error detection approaches depend on the topological structure and external labels of entities in MKGs to improve their quality. Nevertheless, due to the cost of manual annotation and imperfect automatic algorithms, precise entity labels in MKGs cannot be readily obtained. To address these issues, we propose an approach named Enhancing error detection on Medical knowledge graphs via intrinsic labEL (EMKGEL). Considering the absence of hyper-view KG, we establish a hyper-view KG and a triplet-level KG for implicit label information and neighborhood information, respectively. Inspired by the success of graph attention networks (GATs), we introduce the hyper-view GAT to incorporate label messages and neighborhood information into representation learning. We leverage a confidence score that combines local and global trustworthiness to estimate the triplets. To validate the effectiveness of our approach, we conducted experiments on three publicly available MKGs, namely PharmKG-8k, DiseaseKG, and DiaKG. Compared with the baseline models, the Precision@K value improved by 0.7%, 6.1%, and 3.6%, respectively, on these datasets. Furthermore, our method empirically showed that it significantly outperformed the baseline on a general knowledge graph, Nell-995.https://www.mdpi.com/2306-5354/11/3/225medical knowledge grapherror detectionconfidence scoregraph attention network
spellingShingle	Guangya Yu Qi Ye Tong Ruan Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label Bioengineering medical knowledge graph error detection confidence score graph attention network
title	Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_full	Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_fullStr	Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_full_unstemmed	Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_short	Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label
title_sort	enhancing error detection on medical knowledge graphs via intrinsic label
topic	medical knowledge graph error detection confidence score graph attention network
url	https://www.mdpi.com/2306-5354/11/3/225
work_keys_str_mv	AT guangyayu enhancingerrordetectiononmedicalknowledgegraphsviaintrinsiclabel AT qiye enhancingerrordetectiononmedicalknowledgegraphsviaintrinsiclabel AT tongruan enhancingerrordetectiononmedicalknowledgegraphsviaintrinsiclabel

Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic Label

Similar Items