A semantic union model for open domain Chinese knowledge base question answering

Abstract In Open-domain Chinese Knowledge Base Question Answering (ODCKBQA), most common simple questions can be answered by a single relational fact in the knowledge base (KB). The abbreviations, aliases, and nesting of entities in Chinese question sentences, and the gap between them and the struct...

Full description

Bibliographic Details
Main Authors: Huibin Hao, Xiang-e Sun, Jian Wei
Format: Article
Language:English
Published: Nature Portfolio 2023-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-39252-w
Description
Summary:Abstract In Open-domain Chinese Knowledge Base Question Answering (ODCKBQA), most common simple questions can be answered by a single relational fact in the knowledge base (KB). The abbreviations, aliases, and nesting of entities in Chinese question sentences, and the gap between them and the structured semantics in the knowledge base, make it difficult for the system to accurately return answers. This study proposes a semantic union model (SUM), which concatenates candidate entities and candidate relationships, using a contrastive learning algorithm to learn the semantic vector representation of question and candidate entity-relation pairs, and perform cosine similarity calculations to simultaneously complete entity disambiguation and relation matching tasks. It can provide information for entity disambiguation through the relationships between entities, avoid error propagation, and improve the system performance. The experimental results show that the system achieves a good average F1 of 85.94% on the dataset provided by the NLPCC-ICCPOL 2016 KBQA task.
ISSN:2045-2322