A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network

In the emerging age of technologies, machines are becoming more and more skilled and capable just like humans. Despite the fact that machines do not have their own intelligence, but still due to advancement in Artificial Intelligence (AI), machines are rapidly advancing. The area of Pattern Recognit...

Full description

Bibliographic Details
Main Authors: Khawaja Ubaid Ur Rehman, Yaser Daanial Khan
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8805395/
_version_ 1819174423985389568
author Khawaja Ubaid Ur Rehman
Yaser Daanial Khan
author_facet Khawaja Ubaid Ur Rehman
Yaser Daanial Khan
author_sort Khawaja Ubaid Ur Rehman
collection DOAJ
description In the emerging age of technologies, machines are becoming more and more skilled and capable just like humans. Despite the fact that machines do not have their own intelligence, but still due to advancement in Artificial Intelligence (AI), machines are rapidly advancing. The area of Pattern Recognition (PR) deals with bringing enhancements to identify obscure patterns corresponding to specific classes. Optical Character Recognition (OCR) is a subfield of PR which deals with the recognition of characters. A great work has been done for Japanese, Hindi, Arabic and Chinese scripts, but only a diminutive work has been done for Urdu script. The Urdu language is highly cursive and is written in different calligraphic styles like Naskh, Nastalique, Kofi, Devani and Riqa. The Nastalique font is very calligraphic with aesthetic beauty. The ligature segmentation of Urdu Nastalique is also more difficult as compared to other languages. Urdu Nastalique has some characteristics like stacking of ligatures and cursiveness which makes its ligature segmentation a difficult task. Cursiveness means ligatures are joined together to form a new shape. It contains connected ligatures which makes it more complicated as compared to other languages. The ligature recognition of Urdu text by an OCR is a strenuous task due to variants of scaling, rotation, orientation and font style. In this study, a scale and rotation invariant classifier for Urdu Nastalique OCR is proposed. A combination of scale and location invariant moments is used for feature extraction and the classification is performed using Cascade Forward Backpropagation Neural Network. The model is validated through independent dataset testing and 5-fold cross-validation which gave 96.474% and 96.922% accuracy. The results depict the adaptability of the proposed model due to its high accuracy for recognition of Urdu Nastalique Ligature.
first_indexed 2024-12-22T20:38:45Z
format Article
id doaj.art-3dbe48c74da24e95b915891e079b9a12
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-22T20:38:45Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-3dbe48c74da24e95b915891e079b9a122022-12-21T18:13:23ZengIEEEIEEE Access2169-35362019-01-01712064812066910.1109/ACCESS.2019.29363638805395A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural NetworkKhawaja Ubaid Ur Rehman0https://orcid.org/0000-0002-1377-0376Yaser Daanial Khan1Department of Computer Science, University of Management and Technology, Lahore, PakistanDepartment of Computer Science, University of Management and Technology, Lahore, PakistanIn the emerging age of technologies, machines are becoming more and more skilled and capable just like humans. Despite the fact that machines do not have their own intelligence, but still due to advancement in Artificial Intelligence (AI), machines are rapidly advancing. The area of Pattern Recognition (PR) deals with bringing enhancements to identify obscure patterns corresponding to specific classes. Optical Character Recognition (OCR) is a subfield of PR which deals with the recognition of characters. A great work has been done for Japanese, Hindi, Arabic and Chinese scripts, but only a diminutive work has been done for Urdu script. The Urdu language is highly cursive and is written in different calligraphic styles like Naskh, Nastalique, Kofi, Devani and Riqa. The Nastalique font is very calligraphic with aesthetic beauty. The ligature segmentation of Urdu Nastalique is also more difficult as compared to other languages. Urdu Nastalique has some characteristics like stacking of ligatures and cursiveness which makes its ligature segmentation a difficult task. Cursiveness means ligatures are joined together to form a new shape. It contains connected ligatures which makes it more complicated as compared to other languages. The ligature recognition of Urdu text by an OCR is a strenuous task due to variants of scaling, rotation, orientation and font style. In this study, a scale and rotation invariant classifier for Urdu Nastalique OCR is proposed. A combination of scale and location invariant moments is used for feature extraction and the classification is performed using Cascade Forward Backpropagation Neural Network. The model is validated through independent dataset testing and 5-fold cross-validation which gave 96.474% and 96.922% accuracy. The results depict the adaptability of the proposed model due to its high accuracy for recognition of Urdu Nastalique Ligature.https://ieeexplore.ieee.org/document/8805395/Deep neural network (DNN)optical character recognition (OCR)scale invariant classifierrotation invariant classifier
spellingShingle Khawaja Ubaid Ur Rehman
Yaser Daanial Khan
A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network
IEEE Access
Deep neural network (DNN)
optical character recognition (OCR)
scale invariant classifier
rotation invariant classifier
title A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network
title_full A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network
title_fullStr A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network
title_full_unstemmed A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network
title_short A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network
title_sort scale and rotation invariant urdu nastalique ligature recognition using cascade forward backpropagation neural network
topic Deep neural network (DNN)
optical character recognition (OCR)
scale invariant classifier
rotation invariant classifier
url https://ieeexplore.ieee.org/document/8805395/
work_keys_str_mv AT khawajaubaidurrehman ascaleandrotationinvarianturdunastaliqueligaturerecognitionusingcascadeforwardbackpropagationneuralnetwork
AT yaserdaanialkhan ascaleandrotationinvarianturdunastaliqueligaturerecognitionusingcascadeforwardbackpropagationneuralnetwork
AT khawajaubaidurrehman scaleandrotationinvarianturdunastaliqueligaturerecognitionusingcascadeforwardbackpropagationneuralnetwork
AT yaserdaanialkhan scaleandrotationinvarianturdunastaliqueligaturerecognitionusingcascadeforwardbackpropagationneuralnetwork