Rule-based data augmentation for knowledge graph embedding

Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observ...

Full description

Bibliographic Details
Main Authors: Guangyao Li, Zequn Sun, Lei Qian, Qiang Guo, Wei Hu
Format: Article
Language:English
Published: KeAi Communications Co. Ltd. 2021-01-01
Series:AI Open
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666651021000267
_version_ 1831651703106043904
author Guangyao Li
Zequn Sun
Lei Qian
Qiang Guo
Wei Hu
author_facet Guangyao Li
Zequn Sun
Lei Qian
Qiang Guo
Wei Hu
author_sort Guangyao Li
collection DOAJ
description Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human-understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding-based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.
first_indexed 2024-12-19T15:33:49Z
format Article
id doaj.art-35b8d65e446149a3b02c28e8a09c76c2
institution Directory Open Access Journal
issn 2666-6510
language English
last_indexed 2024-12-19T15:33:49Z
publishDate 2021-01-01
publisher KeAi Communications Co. Ltd.
record_format Article
series AI Open
spelling doaj.art-35b8d65e446149a3b02c28e8a09c76c22022-12-21T20:15:39ZengKeAi Communications Co. Ltd.AI Open2666-65102021-01-012186196Rule-based data augmentation for knowledge graph embeddingGuangyao Li0Zequn Sun1Lei Qian2Qiang Guo3Wei Hu4State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, ChinaState Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, ChinaState Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, ChinaState Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, ChinaState Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China; State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, China; Corresponding author. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China.Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human-understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding-based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.http://www.sciencedirect.com/science/article/pii/S2666651021000267Knowledge graph embeddingData augmentationLogical rules
spellingShingle Guangyao Li
Zequn Sun
Lei Qian
Qiang Guo
Wei Hu
Rule-based data augmentation for knowledge graph embedding
AI Open
Knowledge graph embedding
Data augmentation
Logical rules
title Rule-based data augmentation for knowledge graph embedding
title_full Rule-based data augmentation for knowledge graph embedding
title_fullStr Rule-based data augmentation for knowledge graph embedding
title_full_unstemmed Rule-based data augmentation for knowledge graph embedding
title_short Rule-based data augmentation for knowledge graph embedding
title_sort rule based data augmentation for knowledge graph embedding
topic Knowledge graph embedding
Data augmentation
Logical rules
url http://www.sciencedirect.com/science/article/pii/S2666651021000267
work_keys_str_mv AT guangyaoli rulebaseddataaugmentationforknowledgegraphembedding
AT zequnsun rulebaseddataaugmentationforknowledgegraphembedding
AT leiqian rulebaseddataaugmentationforknowledgegraphembedding
AT qiangguo rulebaseddataaugmentationforknowledgegraphembedding
AT weihu rulebaseddataaugmentationforknowledgegraphembedding