Few-Shot Relation Extraction on Ancient Chinese Documents

Traditional humanity scholars’ inefficient method of utilizing numerous unstructured data has hampered studies on ancient Chinese writings for several years. In this work, we aim to develop a relation extractor for ancient Chinese documents to automatically extract the relations by using unstructure...

Full description

Bibliographic Details
Main Authors: Bo Li, Jiyu Wei, Yang Liu, Yuze Chen, Xi Fang, Bin Jiang
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/24/12060
_version_ 1827674272920764416
author Bo Li
Jiyu Wei
Yang Liu
Yuze Chen
Xi Fang
Bin Jiang
author_facet Bo Li
Jiyu Wei
Yang Liu
Yuze Chen
Xi Fang
Bin Jiang
author_sort Bo Li
collection DOAJ
description Traditional humanity scholars’ inefficient method of utilizing numerous unstructured data has hampered studies on ancient Chinese writings for several years. In this work, we aim to develop a relation extractor for ancient Chinese documents to automatically extract the relations by using unstructured data. To achieve this goal, we proposed a tiny ancient Chinese document relation classification (TinyACD-RC) dataset annotated by historians and contains 32 types of general relations in ShihChi (a famous Chinese history book). We also explored several methods and proposed a novel model that works well on sufficient and insufficient data scenarios, the proposed sentence encoder can simultaneously capture local and global features for a certain period. The paired attention network enhances and extracts relations between support and query instances. Experimental results show that our model achieved promising performance with scarce corpus. We also examined our model on the FewRel dataset and found that outperformed the state-of-the-art no pretraining-based models by 2.27%.
first_indexed 2024-03-10T04:37:12Z
format Article
id doaj.art-fbfe33caad3149179c9805e4e1de784d
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T04:37:12Z
publishDate 2021-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-fbfe33caad3149179c9805e4e1de784d2023-11-23T03:42:37ZengMDPI AGApplied Sciences2076-34172021-12-0111241206010.3390/app112412060Few-Shot Relation Extraction on Ancient Chinese DocumentsBo Li0Jiyu Wei1Yang Liu2Yuze Chen3Xi Fang4Bin Jiang5School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaTraditional humanity scholars’ inefficient method of utilizing numerous unstructured data has hampered studies on ancient Chinese writings for several years. In this work, we aim to develop a relation extractor for ancient Chinese documents to automatically extract the relations by using unstructured data. To achieve this goal, we proposed a tiny ancient Chinese document relation classification (TinyACD-RC) dataset annotated by historians and contains 32 types of general relations in ShihChi (a famous Chinese history book). We also explored several methods and proposed a novel model that works well on sufficient and insufficient data scenarios, the proposed sentence encoder can simultaneously capture local and global features for a certain period. The paired attention network enhances and extracts relations between support and query instances. Experimental results show that our model achieved promising performance with scarce corpus. We also examined our model on the FewRel dataset and found that outperformed the state-of-the-art no pretraining-based models by 2.27%.https://www.mdpi.com/2076-3417/11/24/12060ancient Chinese documentrelation extractionfew-shot learningsentence encoderdigital humanity
spellingShingle Bo Li
Jiyu Wei
Yang Liu
Yuze Chen
Xi Fang
Bin Jiang
Few-Shot Relation Extraction on Ancient Chinese Documents
Applied Sciences
ancient Chinese document
relation extraction
few-shot learning
sentence encoder
digital humanity
title Few-Shot Relation Extraction on Ancient Chinese Documents
title_full Few-Shot Relation Extraction on Ancient Chinese Documents
title_fullStr Few-Shot Relation Extraction on Ancient Chinese Documents
title_full_unstemmed Few-Shot Relation Extraction on Ancient Chinese Documents
title_short Few-Shot Relation Extraction on Ancient Chinese Documents
title_sort few shot relation extraction on ancient chinese documents
topic ancient Chinese document
relation extraction
few-shot learning
sentence encoder
digital humanity
url https://www.mdpi.com/2076-3417/11/24/12060
work_keys_str_mv AT boli fewshotrelationextractiononancientchinesedocuments
AT jiyuwei fewshotrelationextractiononancientchinesedocuments
AT yangliu fewshotrelationextractiononancientchinesedocuments
AT yuzechen fewshotrelationextractiononancientchinesedocuments
AT xifang fewshotrelationextractiononancientchinesedocuments
AT binjiang fewshotrelationextractiononancientchinesedocuments