Chinese Named Entity Recognition Model Based on Multi-Task Learning

Compared to English, Chinese named entity recognition has lower performance due to the greater ambiguity in entity boundaries in Chinese text, making boundary prediction more difficult. While traditional models have attempted to enhance the definition of Chinese entity boundaries by incorporating ex...

Full description

Bibliographic Details
Main Authors: Qin Fang, Yane Li, Hailin Feng, Yaoping Ruan
Format: Article
Language:English
Published: MDPI AG 2023-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/8/4770
_version_ 1827746116801658880
author Qin Fang
Yane Li
Hailin Feng
Yaoping Ruan
author_facet Qin Fang
Yane Li
Hailin Feng
Yaoping Ruan
author_sort Qin Fang
collection DOAJ
description Compared to English, Chinese named entity recognition has lower performance due to the greater ambiguity in entity boundaries in Chinese text, making boundary prediction more difficult. While traditional models have attempted to enhance the definition of Chinese entity boundaries by incorporating external features such as lexicons or glyphs, they have rarely disentangled the entity boundary prediction problem for separate study. In order to leverage entity boundary information, the named entity recognition task has been decomposed into two subtasks: boundary annotation and type annotation, and a multi-task learning network (MTL-BERT) has been proposed that combines a bidirectional encoder (BERT) model. This network performs joint encoding and specific decoding of the subtasks, enhancing the model’s feature extraction abilities by reinforcing the feature associations between subtasks. Multiple sets of experiments conducted on Weibo NER, MSRA, and OntoNote4.0 public datasets show that the F1 values of MTL-BERT reach 73.8%, 96.5%, and 86.7%, respectively, effectively improving the performance and efficiency of Chinese named entity recognition tasks.
first_indexed 2024-03-11T05:16:57Z
format Article
id doaj.art-7838861899a448d098573eac2002cca5
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T05:16:57Z
publishDate 2023-04-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-7838861899a448d098573eac2002cca52023-11-17T18:08:59ZengMDPI AGApplied Sciences2076-34172023-04-01138477010.3390/app13084770Chinese Named Entity Recognition Model Based on Multi-Task LearningQin Fang0Yane Li1Hailin Feng2Yaoping Ruan3College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, ChinaCollege of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, ChinaCollege of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, ChinaCollege of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, ChinaCompared to English, Chinese named entity recognition has lower performance due to the greater ambiguity in entity boundaries in Chinese text, making boundary prediction more difficult. While traditional models have attempted to enhance the definition of Chinese entity boundaries by incorporating external features such as lexicons or glyphs, they have rarely disentangled the entity boundary prediction problem for separate study. In order to leverage entity boundary information, the named entity recognition task has been decomposed into two subtasks: boundary annotation and type annotation, and a multi-task learning network (MTL-BERT) has been proposed that combines a bidirectional encoder (BERT) model. This network performs joint encoding and specific decoding of the subtasks, enhancing the model’s feature extraction abilities by reinforcing the feature associations between subtasks. Multiple sets of experiments conducted on Weibo NER, MSRA, and OntoNote4.0 public datasets show that the F1 values of MTL-BERT reach 73.8%, 96.5%, and 86.7%, respectively, effectively improving the performance and efficiency of Chinese named entity recognition tasks.https://www.mdpi.com/2076-3417/13/8/4770multi-task learningChinese named entity recognitionjoint learningfeature interactionbi-directional encoder (BERT)
spellingShingle Qin Fang
Yane Li
Hailin Feng
Yaoping Ruan
Chinese Named Entity Recognition Model Based on Multi-Task Learning
Applied Sciences
multi-task learning
Chinese named entity recognition
joint learning
feature interaction
bi-directional encoder (BERT)
title Chinese Named Entity Recognition Model Based on Multi-Task Learning
title_full Chinese Named Entity Recognition Model Based on Multi-Task Learning
title_fullStr Chinese Named Entity Recognition Model Based on Multi-Task Learning
title_full_unstemmed Chinese Named Entity Recognition Model Based on Multi-Task Learning
title_short Chinese Named Entity Recognition Model Based on Multi-Task Learning
title_sort chinese named entity recognition model based on multi task learning
topic multi-task learning
Chinese named entity recognition
joint learning
feature interaction
bi-directional encoder (BERT)
url https://www.mdpi.com/2076-3417/13/8/4770
work_keys_str_mv AT qinfang chinesenamedentityrecognitionmodelbasedonmultitasklearning
AT yaneli chinesenamedentityrecognitionmodelbasedonmultitasklearning
AT hailinfeng chinesenamedentityrecognitionmodelbasedonmultitasklearning
AT yaopingruan chinesenamedentityrecognitionmodelbasedonmultitasklearning