COTA 2.0: an Automatic Corrector of Tunisian Arabic Social Media Texts

In written text, orthographic noise is a common concern for NLP, especially when operating social network comments and raw documents. This is mainly due to its orthographic conventions and morphological ambiguity. We propose to automatically normalize the social media dialect corpora by following CO...

Full description

Bibliographic Details
Main Authors: Asma Mekki, Inès Zribi, Mariem Ellouze, Lamia Hadrich Belguith
Format: Article
Language:English
Published: Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT) 2022-12-01
Series:Jordanian Journal of Computers and Information Technology
Subjects:
Online Access:http://www.ejmanager.com/fulltextpdf.php?mno=61784