Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set

Accomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classifi...

Full description

Bibliographic Details
Main Authors: Ahmed Talat Sahlol, Mohamed Abd Elaziz, Mohammed A. A. Al-Qaness, Sunghwan Kim
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8976177/
_version_ 1819276419842179072
author Ahmed Talat Sahlol
Mohamed Abd Elaziz
Mohammed A. A. Al-Qaness
Sunghwan Kim
author_facet Ahmed Talat Sahlol
Mohamed Abd Elaziz
Mohammed A. A. Al-Qaness
Sunghwan Kim
author_sort Ahmed Talat Sahlol
collection DOAJ
description Accomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classification. Recent literature focused on the selection of appropriate features as a key point towards building a successful and sufficient character recognition system. In this paper, we propose a hybrid machine learning approach that utilizes neighborhood rough sets with a binary whale optimization algorithm to select the most appropriate features for the recognition of handwritten Arabic characters. To validate the proposed approach, we used the CENPARMI dataset, which is a well-known dataset for machine learning experiments involving handwritten Arabic characters. The results show clear advantages of the proposed approach in terms of recognition accuracy, memory footprint, and processor time than those without the features of the proposed method. When comparing the results of the proposed method with other recent state-of-the-art optimization algorithms, the proposed approach outperformed all others in all experiments. Moreover, the proposed approach shows the highest recognition rate with the smallest consumption time compared to deep neural networks such as VGGnet, Resnet, Nasnet, Mobilenet, Inception, and Xception. The proposed approach was also compared with recently published works using the same dataset, which further confirmed the outstanding classification accuracy and time consumption of this approach. The misclassified failure cases were studied and analyzed, which showed that they would likely be confusing for even Arabic natives because the correct interpretation of the characters required the context of their appearance.
first_indexed 2024-12-23T23:39:56Z
format Article
id doaj.art-bc66a1e7db6049fcb2d489308dff0bc0
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-23T23:39:56Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-bc66a1e7db6049fcb2d489308dff0bc02022-12-21T17:25:44ZengIEEEIEEE Access2169-35362020-01-018230112302110.1109/ACCESS.2020.29704388976177Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough SetAhmed Talat Sahlol0https://orcid.org/0000-0002-6221-6961Mohamed Abd Elaziz1https://orcid.org/0000-0002-7682-6269Mohammed A. A. Al-Qaness2https://orcid.org/0000-0002-6956-7641Sunghwan Kim3https://orcid.org/0000-0003-1762-5915Computer Teacher Preparation Department, Faculty of Specific Education, Damietta University, Damietta, EgyptDepartment of Mathematics, Faculty of Science, Zagazig University, Zagazig, EgyptSchool of Computer Science, Wuhan University, Wuhan, ChinaSchool of Electrical Engineering, University of Ulsan, Ulsan, South KoreaAccomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classification. Recent literature focused on the selection of appropriate features as a key point towards building a successful and sufficient character recognition system. In this paper, we propose a hybrid machine learning approach that utilizes neighborhood rough sets with a binary whale optimization algorithm to select the most appropriate features for the recognition of handwritten Arabic characters. To validate the proposed approach, we used the CENPARMI dataset, which is a well-known dataset for machine learning experiments involving handwritten Arabic characters. The results show clear advantages of the proposed approach in terms of recognition accuracy, memory footprint, and processor time than those without the features of the proposed method. When comparing the results of the proposed method with other recent state-of-the-art optimization algorithms, the proposed approach outperformed all others in all experiments. Moreover, the proposed approach shows the highest recognition rate with the smallest consumption time compared to deep neural networks such as VGGnet, Resnet, Nasnet, Mobilenet, Inception, and Xception. The proposed approach was also compared with recently published works using the same dataset, which further confirmed the outstanding classification accuracy and time consumption of this approach. The misclassified failure cases were studied and analyzed, which showed that they would likely be confusing for even Arabic natives because the correct interpretation of the characters required the context of their appearance.https://ieeexplore.ieee.org/document/8976177/Machine learning approachfeature selectionoptimizationArabic handwritten character recognitionwhale optimizationneighborhood rough set
spellingShingle Ahmed Talat Sahlol
Mohamed Abd Elaziz
Mohammed A. A. Al-Qaness
Sunghwan Kim
Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
IEEE Access
Machine learning approach
feature selection
optimization
Arabic handwritten character recognition
whale optimization
neighborhood rough set
title Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
title_full Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
title_fullStr Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
title_full_unstemmed Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
title_short Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set
title_sort handwritten arabic optical character recognition approach based on hybrid whale optimization algorithm with neighborhood rough set
topic Machine learning approach
feature selection
optimization
Arabic handwritten character recognition
whale optimization
neighborhood rough set
url https://ieeexplore.ieee.org/document/8976177/
work_keys_str_mv AT ahmedtalatsahlol handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset
AT mohamedabdelaziz handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset
AT mohammedaaalqaness handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset
AT sunghwankim handwrittenarabicopticalcharacterrecognitionapproachbasedonhybridwhaleoptimizationalgorithmwithneighborhoodroughset