Machine learning based framework for fine-grained word segmentation and enhanced text normalization for low resourced language

In text applications, pre-processing is deemed as a significant parameter to enhance the outcomes of natural language processing (NLP) chores. Text normalization and tokenization are two pivotal procedures of text pre-processing that cannot be overstated. Text normalization refers to transforming ra...

Full description

Bibliographic Details
Main Authors: Shahzad Nazir, Muhammad Asif, Mariam Rehman, Shahbaz Ahmad
Format: Article
Language:English
Published: PeerJ Inc. 2024-01-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1704.pdf