COTA 2.0: an Automatic Corrector of Tunisian Arabic Social Media Texts
In written text, orthographic noise is a common concern for NLP, especially when operating social network comments and raw documents. This is mainly due to its orthographic conventions and morphological ambiguity. We propose to automatically normalize the social media dialect corpora by following CO...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
2022-12-01
|
Series: | Jordanian Journal of Computers and Information Technology |
Subjects: | |
Online Access: | http://www.ejmanager.com/fulltextpdf.php?mno=61784 |