Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture

The domestic population has paid increasing attention to ancient Chinese history and culture with the continuous improvement of people’s living standards, the rapid economic growth, and the rapid advancement of information science and technology. The use of information technology has been proven to...

Full description

Bibliographic Details
Main Authors: Shuang Liu, Hui Yang, Jiayi Li, Simon Kolmanič
Format: Article
Language:English
Published: MDPI AG 2020-03-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/11/4/186
_version_ 1797571919889825792
author Shuang Liu
Hui Yang
Jiayi Li
Simon Kolmanič
author_facet Shuang Liu
Hui Yang
Jiayi Li
Simon Kolmanič
author_sort Shuang Liu
collection DOAJ
description The domestic population has paid increasing attention to ancient Chinese history and culture with the continuous improvement of people’s living standards, the rapid economic growth, and the rapid advancement of information science and technology. The use of information technology has been proven to promote the spread and development of historical culture, and it is becoming a necessary means to promote our traditional culture. This paper will build a knowledge graph of ancient Chinese history and culture in order to facilitate the public to more quickly and accurately understand the relevant knowledge of ancient Chinese history and culture. The construction process is as follows: firstly, use crawler technology to obtain text and table data related to ancient history and culture on Baidu Encyclopedia (similar to Wikipedia) and ancient Chinese history and culture related pages. Among them, the crawler technology crawls the semi-structured data in the information box (InfoBox) in the Baidu Encyclopedia to directly construct the triples required for the knowledge graph, crawls the introductory text information of the entries in Baidu Encyclopedia, and specialized historical and cultural websites (history Chunqiu.com, On History.com) to extract unstructured entities and relationships. Secondly, entity recognition and relationship extraction are performed on an unstructured text. The entity recognition part uses the Bidirectional Long Short-Term Memory-Convolutional Neural Networks-Conditions Random Field (BiLSTM-CNN-CRF) model for entity extraction. The relationship extraction between entities is performed by using the open source tool DeepKE (information extraction tool with language recognition ability developed by Zhejiang University) to extract the relationships between entities. After obtaining the entity and the relationship between the entities, supplement it with the triple data that were constructed from the semi-structured data in the existing knowledge base and Baidu Encyclopedia information box. Subsequently, the ontology construction and the quality evaluation of the entire constructed knowledge graph are performed to form the final knowledge graph of ancient Chinese history and culture.
first_indexed 2024-03-10T20:48:25Z
format Article
id doaj.art-ff50a69e0d9a4b9695994e7fb6de1fee
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-10T20:48:25Z
publishDate 2020-03-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-ff50a69e0d9a4b9695994e7fb6de1fee2023-11-19T20:11:44ZengMDPI AGInformation2078-24892020-03-0111418610.3390/info11040186Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and CultureShuang Liu0Hui Yang1Jiayi Li2Simon Kolmanič3School of Computer Science and Engineering, Dalian Minzu University, Dalian 116600, ChinaSchool of Computer Science and Engineering, Dalian Minzu University, Dalian 116600, ChinaSchool of Computer Science and Engineering, Dalian Minzu University, Dalian 116600, ChinaFaculty of Electrical Engineering and Computer Science, University of Maribor, Koroska cesta 46, SI-2000 Maribor, SloveniaThe domestic population has paid increasing attention to ancient Chinese history and culture with the continuous improvement of people’s living standards, the rapid economic growth, and the rapid advancement of information science and technology. The use of information technology has been proven to promote the spread and development of historical culture, and it is becoming a necessary means to promote our traditional culture. This paper will build a knowledge graph of ancient Chinese history and culture in order to facilitate the public to more quickly and accurately understand the relevant knowledge of ancient Chinese history and culture. The construction process is as follows: firstly, use crawler technology to obtain text and table data related to ancient history and culture on Baidu Encyclopedia (similar to Wikipedia) and ancient Chinese history and culture related pages. Among them, the crawler technology crawls the semi-structured data in the information box (InfoBox) in the Baidu Encyclopedia to directly construct the triples required for the knowledge graph, crawls the introductory text information of the entries in Baidu Encyclopedia, and specialized historical and cultural websites (history Chunqiu.com, On History.com) to extract unstructured entities and relationships. Secondly, entity recognition and relationship extraction are performed on an unstructured text. The entity recognition part uses the Bidirectional Long Short-Term Memory-Convolutional Neural Networks-Conditions Random Field (BiLSTM-CNN-CRF) model for entity extraction. The relationship extraction between entities is performed by using the open source tool DeepKE (information extraction tool with language recognition ability developed by Zhejiang University) to extract the relationships between entities. After obtaining the entity and the relationship between the entities, supplement it with the triple data that were constructed from the semi-structured data in the existing knowledge base and Baidu Encyclopedia information box. Subsequently, the ontology construction and the quality evaluation of the entire constructed knowledge graph are performed to form the final knowledge graph of ancient Chinese history and culture.https://www.mdpi.com/2078-2489/11/4/186knowledge graphancient history and cultureknowledge extractionnamed entity recognitionvisual display
spellingShingle Shuang Liu
Hui Yang
Jiayi Li
Simon Kolmanič
Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture
Information
knowledge graph
ancient history and culture
knowledge extraction
named entity recognition
visual display
title Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture
title_full Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture
title_fullStr Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture
title_full_unstemmed Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture
title_short Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture
title_sort preliminary study on the knowledge graph construction of chinese ancient history and culture
topic knowledge graph
ancient history and culture
knowledge extraction
named entity recognition
visual display
url https://www.mdpi.com/2078-2489/11/4/186
work_keys_str_mv AT shuangliu preliminarystudyontheknowledgegraphconstructionofchineseancienthistoryandculture
AT huiyang preliminarystudyontheknowledgegraphconstructionofchineseancienthistoryandculture
AT jiayili preliminarystudyontheknowledgegraphconstructionofchineseancienthistoryandculture
AT simonkolmanic preliminarystudyontheknowledgegraphconstructionofchineseancienthistoryandculture