A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea

Toponymic entity recognition is currently a critical research hotspot in knowledge graphs. Under the guidance of the national ancient book protection policy and the promotion of the wave of digital humanities research, this paper proposes a toponymic entity recognition model (ALBERT-Conv1D-BiLSTM-CR...

Full description

Bibliographic Details
Main Authors: Yinwei Wei, Yihong Li, Xiaoyi Zhou
Format: Article
Language:English
Published: MDPI AG 2023-12-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/13/1/4
_version_ 1797358934987636736
author Yinwei Wei
Yihong Li
Xiaoyi Zhou
author_facet Yinwei Wei
Yihong Li
Xiaoyi Zhou
author_sort Yinwei Wei
collection DOAJ
description Toponymic entity recognition is currently a critical research hotspot in knowledge graphs. Under the guidance of the national ancient book protection policy and the promotion of the wave of digital humanities research, this paper proposes a toponymic entity recognition model (ALBERT-Conv1D-BiLSTM-CRF) based on the fusion of a pre-trained language model and local features to address the problems of toponymic ambiguity and the differences in ancient and modern grammatical structures in the field of the Genglubu. This model extracts global features with the ALBERT module, fuses global and local features with the Conv1D module, performs sequence modeling with the BiLSTM module to capture deep semantics and long-distance dependency information, and finally, completes sequence annotation with the CRF module. The experiments show that while taking into account the computational resources and cost, this improved model is significantly improved compared with the benchmark model (ALBERT-BiLSTM-CRF), and the precision, recall, and F1 are increased by 0.74%, 1.28%, and 1.01% to 98.08%, 96.67%, and 97.37%, respectively. The model achieved good results in the field of Genglubu.
first_indexed 2024-03-08T15:09:21Z
format Article
id doaj.art-600c021fa8d2487bb0e63355669a5eab
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-08T15:09:21Z
publishDate 2023-12-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-600c021fa8d2487bb0e63355669a5eab2024-01-10T14:54:04ZengMDPI AGElectronics2079-92922023-12-01131410.3390/electronics13010004A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China SeaYinwei Wei0Yihong Li1Xiaoyi Zhou2School of Cyberspace Security, Hainan University, Haikou 570228, ChinaSchool of Cyberspace Security, Hainan University, Haikou 570228, ChinaSchool of Cyberspace Security, Hainan University, Haikou 570228, ChinaToponymic entity recognition is currently a critical research hotspot in knowledge graphs. Under the guidance of the national ancient book protection policy and the promotion of the wave of digital humanities research, this paper proposes a toponymic entity recognition model (ALBERT-Conv1D-BiLSTM-CRF) based on the fusion of a pre-trained language model and local features to address the problems of toponymic ambiguity and the differences in ancient and modern grammatical structures in the field of the Genglubu. This model extracts global features with the ALBERT module, fuses global and local features with the Conv1D module, performs sequence modeling with the BiLSTM module to capture deep semantics and long-distance dependency information, and finally, completes sequence annotation with the CRF module. The experiments show that while taking into account the computational resources and cost, this improved model is significantly improved compared with the benchmark model (ALBERT-BiLSTM-CRF), and the precision, recall, and F1 are increased by 0.74%, 1.28%, and 1.01% to 98.08%, 96.67%, and 97.37%, respectively. The model achieved good results in the field of Genglubu.https://www.mdpi.com/2079-9292/13/1/4toponymic entity recognitionGenglubu corpuspre-trained language modellocal featuredigital humanities
spellingShingle Yinwei Wei
Yihong Li
Xiaoyi Zhou
A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea
Electronics
toponymic entity recognition
Genglubu corpus
pre-trained language model
local feature
digital humanities
title A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea
title_full A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea
title_fullStr A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea
title_full_unstemmed A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea
title_short A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea
title_sort study on toponymic entity recognition based on pre trained models fused with local features for genglubu in the south china sea
topic toponymic entity recognition
Genglubu corpus
pre-trained language model
local feature
digital humanities
url https://www.mdpi.com/2079-9292/13/1/4
work_keys_str_mv AT yinweiwei astudyontoponymicentityrecognitionbasedonpretrainedmodelsfusedwithlocalfeaturesforgenglubuinthesouthchinasea
AT yihongli astudyontoponymicentityrecognitionbasedonpretrainedmodelsfusedwithlocalfeaturesforgenglubuinthesouthchinasea
AT xiaoyizhou astudyontoponymicentityrecognitionbasedonpretrainedmodelsfusedwithlocalfeaturesforgenglubuinthesouthchinasea
AT yinweiwei studyontoponymicentityrecognitionbasedonpretrainedmodelsfusedwithlocalfeaturesforgenglubuinthesouthchinasea
AT yihongli studyontoponymicentityrecognitionbasedonpretrainedmodelsfusedwithlocalfeaturesforgenglubuinthesouthchinasea
AT xiaoyizhou studyontoponymicentityrecognitionbasedonpretrainedmodelsfusedwithlocalfeaturesforgenglubuinthesouthchinasea