Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition
This study delves into the intricate realm of recognizing handwritten Arabic characters, specifically targeting children’s script. Given the inherent complexities of the Arabic script, encompassing semi-cursive styles, distinct character forms based on position, and the inclusion of diacritical mark...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-03-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/14/6/2332 |
_version_ | 1797242213394022400 |
---|---|
author | Sarab AlMuhaideb Najwa Altwaijry Ahad D. AlGhamdy Daad AlKhulaiwi Raghad AlHassan Haya AlOmran Aliyah M. AlSalem |
author_facet | Sarab AlMuhaideb Najwa Altwaijry Ahad D. AlGhamdy Daad AlKhulaiwi Raghad AlHassan Haya AlOmran Aliyah M. AlSalem |
author_sort | Sarab AlMuhaideb |
collection | DOAJ |
description | This study delves into the intricate realm of recognizing handwritten Arabic characters, specifically targeting children’s script. Given the inherent complexities of the Arabic script, encompassing semi-cursive styles, distinct character forms based on position, and the inclusion of diacritical marks, the domain demands specialized attention. While prior research has largely concentrated on adult handwriting, the spotlight here is on children’s handwritten Arabic characters, an area marked by its distinct challenges, such as variations in writing quality and increased distortions. To this end, we introduce a novel dataset, “Dhad”, refined for enhanced quality and quantity. Our investigation employs a tri-fold experimental approach, encompassing the exploration of pre-trained deep learning models (i.e., MobileNet, ResNet50, and DenseNet121), custom-designed Convolutional Neural Network (CNN) architecture, and traditional classifiers (i.e., Support Vector Machine (SVM), Random Forest (RF), and Multilayer Perceptron (MLP)), leveraging deep visual features. The results illuminate the efficacy of fine-tuned pre-existing models, the potential of custom CNN designs, and the intricacies associated with disjointed classification paradigms. The pre-trained model MobileNet achieved the best test accuracy of 93.59% on the Dhad dataset. Additionally, as a conceptual proposal, we introduce the idea of a computer application designed specifically for children aged 7–12, aimed at improving Arabic handwriting skills. Our concluding reflections emphasize the need for nuanced dataset curation, advanced model architectures, and cohesive training strategies to navigate the multifaceted challenges of Arabic character recognition. |
first_indexed | 2024-04-24T18:35:39Z |
format | Article |
id | doaj.art-6ea88afd7c874dbd9ac7f4adcdd8b38e |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-04-24T18:35:39Z |
publishDate | 2024-03-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-6ea88afd7c874dbd9ac7f4adcdd8b38e2024-03-27T13:19:21ZengMDPI AGApplied Sciences2076-34172024-03-01146233210.3390/app14062332Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated RecognitionSarab AlMuhaideb0Najwa Altwaijry1Ahad D. AlGhamdy2Daad AlKhulaiwi3Raghad AlHassan4Haya AlOmran5Aliyah M. AlSalem6Computer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11362, Saudi ArabiaThis study delves into the intricate realm of recognizing handwritten Arabic characters, specifically targeting children’s script. Given the inherent complexities of the Arabic script, encompassing semi-cursive styles, distinct character forms based on position, and the inclusion of diacritical marks, the domain demands specialized attention. While prior research has largely concentrated on adult handwriting, the spotlight here is on children’s handwritten Arabic characters, an area marked by its distinct challenges, such as variations in writing quality and increased distortions. To this end, we introduce a novel dataset, “Dhad”, refined for enhanced quality and quantity. Our investigation employs a tri-fold experimental approach, encompassing the exploration of pre-trained deep learning models (i.e., MobileNet, ResNet50, and DenseNet121), custom-designed Convolutional Neural Network (CNN) architecture, and traditional classifiers (i.e., Support Vector Machine (SVM), Random Forest (RF), and Multilayer Perceptron (MLP)), leveraging deep visual features. The results illuminate the efficacy of fine-tuned pre-existing models, the potential of custom CNN designs, and the intricacies associated with disjointed classification paradigms. The pre-trained model MobileNet achieved the best test accuracy of 93.59% on the Dhad dataset. Additionally, as a conceptual proposal, we introduce the idea of a computer application designed specifically for children aged 7–12, aimed at improving Arabic handwriting skills. Our concluding reflections emphasize the need for nuanced dataset curation, advanced model architectures, and cohesive training strategies to navigate the multifaceted challenges of Arabic character recognition.https://www.mdpi.com/2076-3417/14/6/2332deep learningpre-trained modelschild handwriting recognitionDhadHijja |
spellingShingle | Sarab AlMuhaideb Najwa Altwaijry Ahad D. AlGhamdy Daad AlKhulaiwi Raghad AlHassan Haya AlOmran Aliyah M. AlSalem Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition Applied Sciences deep learning pre-trained models child handwriting recognition Dhad Hijja |
title | Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition |
title_full | Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition |
title_fullStr | Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition |
title_full_unstemmed | Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition |
title_short | Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition |
title_sort | dhad a children s handwritten arabic characters dataset for automated recognition |
topic | deep learning pre-trained models child handwriting recognition Dhad Hijja |
url | https://www.mdpi.com/2076-3417/14/6/2332 |
work_keys_str_mv | AT sarabalmuhaideb dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition AT najwaaltwaijry dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition AT ahaddalghamdy dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition AT daadalkhulaiwi dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition AT raghadalhassan dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition AT hayaalomran dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition AT aliyahmalsalem dhadachildrenshandwrittenarabiccharactersdatasetforautomatedrecognition |