Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer

The task of named entity recognition (NER) is to identify entities in the text and predict their categories. In real-life scenarios, the context of the text is often complex, and there may exist nested entities within an entity. This kind of entity is called a nested entity, and the task of recogniz...

Full description

Bibliographic Details
Main Authors: Weijun Li, Jintong Liu, Yuxiao Gao, Xinyong Zhang, Jianlai Gu
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Applied System Innovation
Subjects:
Online Access:https://www.mdpi.com/2571-5577/7/1/8
_version_ 1797298981577949184
author Weijun Li
Jintong Liu
Yuxiao Gao
Xinyong Zhang
Jianlai Gu
author_facet Weijun Li
Jintong Liu
Yuxiao Gao
Xinyong Zhang
Jianlai Gu
author_sort Weijun Li
collection DOAJ
description The task of named entity recognition (NER) is to identify entities in the text and predict their categories. In real-life scenarios, the context of the text is often complex, and there may exist nested entities within an entity. This kind of entity is called a nested entity, and the task of recognizing entities with nested structures is referred to as nested named entity recognition. Most existing NER models can only handle flat entities, and there has been limited research progress in Chinese nested named entity recognition, resulting in relatively few models in this direction. General NER models have limited semantic extraction capabilities and cannot capture deep semantic information between nested entities in the text. To address these issues, this paper proposes a model that uses the GlobalPointer module to identify nested entities in the text and constructs the IDCNNLR semantic extraction module to extract deep semantic information. Furthermore, multiple-head self-attention mechanisms are incorporated into the model at multiple positions to achieve data denoising, enhancing the quality of semantic features. The proposed model considers each possible entity boundary through the GlobalPointer module, and the IDCNNLR semantic extraction module and multi-position attention mechanism are introduced to enhance the model’s semantic extraction capability. Experimental results demonstrate that the proposed model achieves <i>F1</i> scores of 69.617% and 79.285% on the CMeEE Chinese nested entity recognition dataset and CLUENER2020 Chinese fine-grained entity recognition dataset, respectively. The model exhibits improvement compared to baseline models, and each innovation point shows effective performance enhancement in ablative experiments.
first_indexed 2024-03-07T22:42:54Z
format Article
id doaj.art-b67970492f124d0a8571f60ac3c8aceb
institution Directory Open Access Journal
issn 2571-5577
language English
last_indexed 2024-03-07T22:42:54Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Applied System Innovation
spelling doaj.art-b67970492f124d0a8571f60ac3c8aceb2024-02-23T15:06:57ZengMDPI AGApplied System Innovation2571-55772024-01-0171810.3390/asi7010008Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointerWeijun Li0Jintong Liu1Yuxiao Gao2Xinyong Zhang3Jianlai Gu4School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, ChinaSchool of Computer Science and Engineering, North Minzu University, Yinchuan 750021, ChinaSchool of Computer Science and Engineering, North Minzu University, Yinchuan 750021, ChinaSchool of Computer Science and Engineering, North Minzu University, Yinchuan 750021, ChinaSchool of Computer Science and Engineering, North Minzu University, Yinchuan 750021, ChinaThe task of named entity recognition (NER) is to identify entities in the text and predict their categories. In real-life scenarios, the context of the text is often complex, and there may exist nested entities within an entity. This kind of entity is called a nested entity, and the task of recognizing entities with nested structures is referred to as nested named entity recognition. Most existing NER models can only handle flat entities, and there has been limited research progress in Chinese nested named entity recognition, resulting in relatively few models in this direction. General NER models have limited semantic extraction capabilities and cannot capture deep semantic information between nested entities in the text. To address these issues, this paper proposes a model that uses the GlobalPointer module to identify nested entities in the text and constructs the IDCNNLR semantic extraction module to extract deep semantic information. Furthermore, multiple-head self-attention mechanisms are incorporated into the model at multiple positions to achieve data denoising, enhancing the quality of semantic features. The proposed model considers each possible entity boundary through the GlobalPointer module, and the IDCNNLR semantic extraction module and multi-position attention mechanism are introduced to enhance the model’s semantic extraction capability. Experimental results demonstrate that the proposed model achieves <i>F1</i> scores of 69.617% and 79.285% on the CMeEE Chinese nested entity recognition dataset and CLUENER2020 Chinese fine-grained entity recognition dataset, respectively. The model exhibits improvement compared to baseline models, and each innovation point shows effective performance enhancement in ablative experiments.https://www.mdpi.com/2571-5577/7/1/8named entity recognitionnatural language processingdeep neural networksfeature extractionknowledge graph
spellingShingle Weijun Li
Jintong Liu
Yuxiao Gao
Xinyong Zhang
Jianlai Gu
Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
Applied System Innovation
named entity recognition
natural language processing
deep neural networks
feature extraction
knowledge graph
title Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
title_full Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
title_fullStr Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
title_full_unstemmed Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
title_short Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
title_sort research on chinese nested entity recognition based on idcnnlr and globalpointer
topic named entity recognition
natural language processing
deep neural networks
feature extraction
knowledge graph
url https://www.mdpi.com/2571-5577/7/1/8
work_keys_str_mv AT weijunli researchonchinesenestedentityrecognitionbasedonidcnnlrandglobalpointer
AT jintongliu researchonchinesenestedentityrecognitionbasedonidcnnlrandglobalpointer
AT yuxiaogao researchonchinesenestedentityrecognitionbasedonidcnnlrandglobalpointer
AT xinyongzhang researchonchinesenestedentityrecognitionbasedonidcnnlrandglobalpointer
AT jianlaigu researchonchinesenestedentityrecognitionbasedonidcnnlrandglobalpointer