Enhancing Korean Named Entity Recognition With Linguistic Tokenization Strategies

Tokenization is a significant primary step for the training of the Pre-trained Language Model (PLM), which alleviates the challenging Out-of-Vocabulary problem in the area of Natural Language Processing. As tokenization strategies can change linguistic understanding, it is essential to consider the...

Full description

Bibliographic Details
Main Authors: Gyeongmin Kim, Junyoung Son, Jinsung Kim, Hyunhee Lee, Heuiseok Lim
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9610031/