Multilingual character recognition dataset for Moroccan official documents

This article focuses on the construction of a dataset for multilingual character recognition in Moroccan official documents. The dataset covers languages such as Arabic, French, and Tamazight and are built programmatically to ensure data diversity. It consists of sub-datasets such as Uppercase alpha...

Full description

Bibliographic Details
Main Authors: Ali Benaissa, Abdelkhalak Bahri, Ahmad El Allaoui
Format: Article
Language:English
Published: Elsevier 2024-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340923009848