Rule-based data augmentation for knowledge graph embedding

Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observ...

Full description

Bibliographic Details
Main Authors:	Guangyao Li, Zequn Sun, Lei Qian, Qiang Guo, Wei Hu
Format:	Article
Language:	English
Published:	KeAi Communications Co. Ltd. 2021-01-01
Series:	AI Open
Subjects:	Knowledge graph embedding Data augmentation Logical rules
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666651021000267

_version_	1831651703106043904
author	Guangyao Li Zequn Sun Lei Qian Qiang Guo Wei Hu
author_facet	Guangyao Li Zequn Sun Lei Qian Qiang Guo Wei Hu
author_sort	Guangyao Li
collection	DOAJ
description	Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human-understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding-based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.
first_indexed	2024-12-19T15:33:49Z
format	Article
id	doaj.art-35b8d65e446149a3b02c28e8a09c76c2
institution	Directory Open Access Journal
issn	2666-6510
language	English
last_indexed	2024-12-19T15:33:49Z
publishDate	2021-01-01
publisher	KeAi Communications Co. Ltd.
record_format	Article
series	AI Open
spelling	doaj.art-35b8d65e446149a3b02c28e8a09c76c22022-12-21T20:15:39ZengKeAi Communications Co. Ltd.AI Open2666-65102021-01-012186196Rule-based data augmentation for knowledge graph embeddingGuangyao Li0Zequn Sun1Lei Qian2Qiang Guo3Wei Hu4State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, ChinaState Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, ChinaState Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, ChinaState Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, ChinaState Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China; State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, China; Corresponding author. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China.Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human-understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding-based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.http://www.sciencedirect.com/science/article/pii/S2666651021000267Knowledge graph embeddingData augmentationLogical rules
spellingShingle	Guangyao Li Zequn Sun Lei Qian Qiang Guo Wei Hu Rule-based data augmentation for knowledge graph embedding AI Open Knowledge graph embedding Data augmentation Logical rules
title	Rule-based data augmentation for knowledge graph embedding
title_full	Rule-based data augmentation for knowledge graph embedding
title_fullStr	Rule-based data augmentation for knowledge graph embedding
title_full_unstemmed	Rule-based data augmentation for knowledge graph embedding
title_short	Rule-based data augmentation for knowledge graph embedding
title_sort	rule based data augmentation for knowledge graph embedding
topic	Knowledge graph embedding Data augmentation Logical rules
url	http://www.sciencedirect.com/science/article/pii/S2666651021000267
work_keys_str_mv	AT guangyaoli rulebaseddataaugmentationforknowledgegraphembedding AT zequnsun rulebaseddataaugmentationforknowledgegraphembedding AT leiqian rulebaseddataaugmentationforknowledgegraphembedding AT qiangguo rulebaseddataaugmentationforknowledgegraphembedding AT weihu rulebaseddataaugmentationforknowledgegraphembedding

Rule-based data augmentation for knowledge graph embedding

Similar Items