COTA 2.0: an Automatic Corrector of Tunisian Arabic Social Media Texts
In written text, orthographic noise is a common concern for NLP, especially when operating social network comments and raw documents. This is mainly due to its orthographic conventions and morphological ambiguity. We propose to automatically normalize the social media dialect corpora by following CO...
Main Authors: | Asma Mekki, Inès Zribi, Mariem Ellouze, Lamia Hadrich Belguith |
---|---|
Format: | Article |
Language: | English |
Published: |
Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
2022-12-01
|
Series: | Jordanian Journal of Computers and Information Technology |
Subjects: | |
Online Access: | http://www.ejmanager.com/fulltextpdf.php?mno=61784 |
Similar Items
-
The Effect of L1 Persian on the Acquisition of English L2 Orthographic System on the Shared Grounds
by: Ali Akbar Jabbari, et al.
Published: (2014-06-01) -
Cotas étnico-raciais e cotas epistêmicas: bases para uma antropologia antirracista e descolonizadora
by: José Jorge de Carvalho
Published: (2022-12-01) -
Effects of Orthographic Consistency on Bilingual Reading: Human and Computer Simulation Data
by: Eraldo Paulesu, et al.
Published: (2021-06-01) -
Sistema de cotas
by: Fabíola Cristina de Oliveira Bento Aquino, et al.
Published: (2023-09-01) -
Market Structure and Performance of Tunisian Banks
by: Ines Ayadi, et al.
Published: (2013-06-01)