Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
Accomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classifi...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8976177/ |
_version_ | 1819276419842179072 |
---|---|
author | Ahmed Talat Sahlol Mohamed Abd Elaziz Mohammed A. A. Al-Qaness Sunghwan Kim |
author_facet | Ahmed Talat Sahlol Mohamed Abd Elaziz Mohammed A. A. Al-Qaness Sunghwan Kim |
author_sort | Ahmed Talat Sahlol |
collection | DOAJ |
description | Accomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classification. Recent literature focused on the selection of appropriate features as a key point towards building a successful and sufficient character recognition system. In this paper, we propose a hybrid machine learning approach that utilizes neighborhood rough sets with a binary whale optimization algorithm to select the most appropriate features for the recognition of handwritten Arabic characters. To validate the proposed approach, we used the CENPARMI dataset, which is a well-known dataset for machine learning experiments involving handwritten Arabic characters. The results show clear advantages of the proposed approach in terms of recognition accuracy, memory footprint, and processor time than those without the features of the proposed method. When comparing the results of the proposed method with other recent state-of-the-art optimization algorithms, the proposed approach outperformed all others in all experiments. Moreover, the proposed approach shows the highest recognition rate with the smallest consumption time compared to deep neural networks such as VGGnet, Resnet, Nasnet, Mobilenet, Inception, and Xception. The proposed approach was also compared with recently published works using the same dataset, which further confirmed the outstanding classification accuracy and time consumption of this approach. The misclassified failure cases were studied and analyzed, which showed that they would likely be confusing for even Arabic natives because the correct interpretation of the characters required the context of their appearance. |
first_indexed | 2024-12-23T23:39:56Z |
format | Article |
id | doaj.art-bc66a1e7db6049fcb2d489308dff0bc0 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-23T23:39:56Z |
publishDate | 2020-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-bc66a1e7db6049fcb2d489308dff0bc02022-12-21T17:25:44ZengIEEEIEEE Access2169-35362020-01-018230112302110.1109/ACCESS.2020.29704388976177Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough SetAhmed Talat Sahlol0https://orcid.org/0000-0002-6221-6961Mohamed Abd Elaziz1https://orcid.org/0000-0002-7682-6269Mohammed A. A. Al-Qaness2https://orcid.org/0000-0002-6956-7641Sunghwan Kim3https://orcid.org/0000-0003-1762-5915Computer Teacher Preparation Department, Faculty of Specific Education, Damietta University, Damietta, EgyptDepartment of Mathematics, Faculty of Science, Zagazig University, Zagazig, EgyptSchool of Computer Science, Wuhan University, Wuhan, ChinaSchool of Electrical Engineering, University of Ulsan, Ulsan, South KoreaAccomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classification. Recent literature focused on the selection of appropriate features as a key point towards building a successful and sufficient character recognition system. In this paper, we propose a hybrid machine learning approach that utilizes neighborhood rough sets with a binary whale optimization algorithm to select the most appropriate features for the recognition of handwritten Arabic characters. To validate the proposed approach, we used the CENPARMI dataset, which is a well-known dataset for machine learning experiments involving handwritten Arabic characters. The results show clear advantages of the proposed approach in terms of recognition accuracy, memory footprint, and processor time than those without the features of the proposed method. When comparing the results of the proposed method with other recent state-of-the-art optimization algorithms, the proposed approach outperformed all others in all experiments. Moreover, the proposed approach shows the highest recognition rate with the smallest consumption time compared to deep neural networks such as VGGnet, Resnet, Nasnet, Mobilenet, Inception, and Xception. The proposed approach was also compared with recently published works using the same dataset, which further confirmed the outstanding classification accuracy and time consumption of this approach. The misclassified failure cases were studied and analyzed, which showed that they would likely be confusing for even Arabic natives because the correct interpretation of the characters required the context of their appearance.https://ieeexplore.ieee.org/document/8976177/Machine learning approachfeature selectionoptimizationArabic handwritten character recognitionwhale optimizationneighborhood rough set |
spellingShingle | Ahmed Talat Sahlol Mohamed Abd Elaziz Mohammed A. A. Al-Qaness Sunghwan Kim Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set IEEE Access Machine learning approach feature selection optimization Arabic handwritten character recognition whale optimization neighborhood rough set |
title | Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set |
title_full | Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set |
title_fullStr | Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set |
title_full_unstemmed | Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set |
title_short | Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set |
title_sort | handwritten arabic optical character recognition approach based on hybrid whale optimization algorithm with neighborhood rough set |
topic | Machine learning approach feature selection optimization Arabic handwritten character recognition whale optimization neighborhood rough set |
url | https://ieeexplore.ieee.org/document/8976177/ |
work_keys_str_mv | AT ahmedtalatsahlol handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset AT mohamedabdelaziz handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset AT mohammedaaalqaness handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset AT sunghwankim handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset |