Handwritten Digit Classification in Bangla and Hindi Using Deep Learning

Handwritten digit classification is a well-known and important problem in the field of optical character recognition (OCR). The primary challenge is correctly classifying digits which are highly varied in their visual characteristics primarily due to the writing styles of different individuals. In t...

Full description

Bibliographic Details
Main Authors: Jishnu Mukhoti, Sukanya Dutta, Ram Sarkar
Format: Article
Language:English
Published: Taylor & Francis Group 2020-12-01
Series:Applied Artificial Intelligence
Online Access:http://dx.doi.org/10.1080/08839514.2020.1804228
_version_ 1797684898897592320
author Jishnu Mukhoti
Sukanya Dutta
Ram Sarkar
author_facet Jishnu Mukhoti
Sukanya Dutta
Ram Sarkar
author_sort Jishnu Mukhoti
collection DOAJ
description Handwritten digit classification is a well-known and important problem in the field of optical character recognition (OCR). The primary challenge is correctly classifying digits which are highly varied in their visual characteristics primarily due to the writing styles of different individuals. In this paper, we propose the use of Convolutional Neural Networks (CNN) for the purpose of classifying handwritten Bangla and Hindi numerals. The major advantage that we face by using a CNN-based classifier is that no prior hand-crafted feature needs to be extracted from the images for efficient and accurate classification. An added benefit of a CNN classifier is that it provides translational invariance and a certain extent of rotational invariance during recognition. Applications can be found in real-time OCR systems where input images are often not perfectly oriented along a vertical axis. In this work, we use modified versions of the well-known LeNet CNN architecture. Extensive experiments have revealed a best-case classification accuracy of 98.2% for Bangla and 98.8% for Hindi numerals outperforming competitive models in the literature.
first_indexed 2024-03-12T00:36:28Z
format Article
id doaj.art-b4c785b3c3ad4c189d02b40196bde4d9
institution Directory Open Access Journal
issn 0883-9514
1087-6545
language English
last_indexed 2024-03-12T00:36:28Z
publishDate 2020-12-01
publisher Taylor & Francis Group
record_format Article
series Applied Artificial Intelligence
spelling doaj.art-b4c785b3c3ad4c189d02b40196bde4d92023-09-15T09:33:58ZengTaylor & Francis GroupApplied Artificial Intelligence0883-95141087-65452020-12-0134141074109910.1080/08839514.2020.18042281804228Handwritten Digit Classification in Bangla and Hindi Using Deep LearningJishnu Mukhoti0Sukanya Dutta1Ram Sarkar2Jadavpur UniversityJadavpur UniversityJadavpur UniversityHandwritten digit classification is a well-known and important problem in the field of optical character recognition (OCR). The primary challenge is correctly classifying digits which are highly varied in their visual characteristics primarily due to the writing styles of different individuals. In this paper, we propose the use of Convolutional Neural Networks (CNN) for the purpose of classifying handwritten Bangla and Hindi numerals. The major advantage that we face by using a CNN-based classifier is that no prior hand-crafted feature needs to be extracted from the images for efficient and accurate classification. An added benefit of a CNN classifier is that it provides translational invariance and a certain extent of rotational invariance during recognition. Applications can be found in real-time OCR systems where input images are often not perfectly oriented along a vertical axis. In this work, we use modified versions of the well-known LeNet CNN architecture. Extensive experiments have revealed a best-case classification accuracy of 98.2% for Bangla and 98.8% for Hindi numerals outperforming competitive models in the literature.http://dx.doi.org/10.1080/08839514.2020.1804228
spellingShingle Jishnu Mukhoti
Sukanya Dutta
Ram Sarkar
Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
Applied Artificial Intelligence
title Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
title_full Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
title_fullStr Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
title_full_unstemmed Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
title_short Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
title_sort handwritten digit classification in bangla and hindi using deep learning
url http://dx.doi.org/10.1080/08839514.2020.1804228
work_keys_str_mv AT jishnumukhoti handwrittendigitclassificationinbanglaandhindiusingdeeplearning
AT sukanyadutta handwrittendigitclassificationinbanglaandhindiusingdeeplearning
AT ramsarkar handwrittendigitclassificationinbanglaandhindiusingdeeplearning