Tokenizers for African Languages

Despite incredible development in the field of natural language processing (NLP), there has been a huge gap in the performance of NLP tasks between high-resource languages (HRLs) and low-resource languages (LRLs). African languages belong mainly to the LRLs, and one of the major contributing factors...

Full description

Bibliographic Details
Main Authors: Goodwill Erasmo Ndomba, Medard Edmund Mswahili, Young-Seob Jeong
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10815724/