Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition

This study delves into the intricate realm of recognizing handwritten Arabic characters, specifically targeting children’s script. Given the inherent complexities of the Arabic script, encompassing semi-cursive styles, distinct character forms based on position, and the inclusion of diacritical mark...

Full description

Bibliographic Details
Main Authors: Sarab AlMuhaideb, Najwa Altwaijry, Ahad D. AlGhamdy, Daad AlKhulaiwi, Raghad AlHassan, Haya AlOmran, Aliyah M. AlSalem
Format: Article
Language:English
Published: MDPI AG 2024-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/6/2332
_version_ 1797242213394022400
author Sarab AlMuhaideb
Najwa Altwaijry
Ahad D. AlGhamdy
Daad AlKhulaiwi
Raghad AlHassan
Haya AlOmran
Aliyah M. AlSalem
author_facet Sarab AlMuhaideb
Najwa Altwaijry
Ahad D. AlGhamdy
Daad AlKhulaiwi
Raghad AlHassan
Haya AlOmran
Aliyah M. AlSalem
author_sort Sarab AlMuhaideb
collection DOAJ
description This study delves into the intricate realm of recognizing handwritten Arabic characters, specifically targeting children’s script. Given the inherent complexities of the Arabic script, encompassing semi-cursive styles, distinct character forms based on position, and the inclusion of diacritical marks, the domain demands specialized attention. While prior research has largely concentrated on adult handwriting, the spotlight here is on children’s handwritten Arabic characters, an area marked by its distinct challenges, such as variations in writing quality and increased distortions. To this end, we introduce a novel dataset, “Dhad”, refined for enhanced quality and quantity. Our investigation employs a tri-fold experimental approach, encompassing the exploration of pre-trained deep learning models (i.e., MobileNet, ResNet50, and DenseNet121), custom-designed Convolutional Neural Network (CNN) architecture, and traditional classifiers (i.e., Support Vector Machine (SVM), Random Forest (RF), and Multilayer Perceptron (MLP)), leveraging deep visual features. The results illuminate the efficacy of fine-tuned pre-existing models, the potential of custom CNN designs, and the intricacies associated with disjointed classification paradigms. The pre-trained model MobileNet achieved the best test accuracy of 93.59% on the Dhad dataset. Additionally, as a conceptual proposal, we introduce the idea of a computer application designed specifically for children aged 7–12, aimed at improving Arabic handwriting skills. Our concluding reflections emphasize the need for nuanced dataset curation, advanced model architectures, and cohesive training strategies to navigate the multifaceted challenges of Arabic character recognition.
first_indexed 2024-04-24T18:35:39Z
format Article
id doaj.art-6ea88afd7c874dbd9ac7f4adcdd8b38e
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-04-24T18:35:39Z
publishDate 2024-03-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-6ea88afd7c874dbd9ac7f4adcdd8b38e2024-03-27T13:19:21ZengMDPI AGApplied Sciences2076-34172024-03-01146233210.3390/app14062332Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated RecognitionSarab AlMuhaideb0Najwa Altwaijry1Ahad D. AlGhamdy2Daad AlKhulaiwi3Raghad AlHassan4Haya AlOmran5Aliyah M. AlSalem6Computer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaThis study delves into the intricate realm of recognizing handwritten Arabic characters, specifically targeting children’s script. Given the inherent complexities of the Arabic script, encompassing semi-cursive styles, distinct character forms based on position, and the inclusion of diacritical marks, the domain demands specialized attention. While prior research has largely concentrated on adult handwriting, the spotlight here is on children’s handwritten Arabic characters, an area marked by its distinct challenges, such as variations in writing quality and increased distortions. To this end, we introduce a novel dataset, “Dhad”, refined for enhanced quality and quantity. Our investigation employs a tri-fold experimental approach, encompassing the exploration of pre-trained deep learning models (i.e., MobileNet, ResNet50, and DenseNet121), custom-designed Convolutional Neural Network (CNN) architecture, and traditional classifiers (i.e., Support Vector Machine (SVM), Random Forest (RF), and Multilayer Perceptron (MLP)), leveraging deep visual features. The results illuminate the efficacy of fine-tuned pre-existing models, the potential of custom CNN designs, and the intricacies associated with disjointed classification paradigms. The pre-trained model MobileNet achieved the best test accuracy of 93.59% on the Dhad dataset. Additionally, as a conceptual proposal, we introduce the idea of a computer application designed specifically for children aged 7–12, aimed at improving Arabic handwriting skills. Our concluding reflections emphasize the need for nuanced dataset curation, advanced model architectures, and cohesive training strategies to navigate the multifaceted challenges of Arabic character recognition.https://www.mdpi.com/2076-3417/14/6/2332deep learningpre-trained modelschild handwriting recognitionDhadHijja
spellingShingle Sarab AlMuhaideb
Najwa Altwaijry
Ahad D. AlGhamdy
Daad AlKhulaiwi
Raghad AlHassan
Haya AlOmran
Aliyah M. AlSalem
Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
Applied Sciences
deep learning
pre-trained models
child handwriting recognition
Dhad
Hijja
title Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
title_full Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
title_fullStr Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
title_full_unstemmed Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
title_short Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
title_sort dhad a children s handwritten arabic characters dataset for automated recognition
topic deep learning
pre-trained models
child handwriting recognition
Dhad
Hijja
url https://www.mdpi.com/2076-3417/14/6/2332
work_keys_str_mv AT sarabalmuhaideb dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition
AT najwaaltwaijry dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition
AT ahaddalghamdy dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition
AT daadalkhulaiwi dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition
AT raghadalhassan dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition
AT hayaalomran dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition
AT aliyahmalsalem dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition