M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification
The feature selection process is very important in the field of pattern recognition, which selects the informative features so as to reduce the curse of dimensionality, thus improving the overall classification accuracy. In this paper, a new feature selection approach named Memory-Based Histogram-Or...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
De Gruyter
2019-06-01
|
Series: | Journal of Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1515/jisys-2019-0064 |
_version_ | 1818720407962779648 |
---|---|
author | Guha Ritam Ghosh Manosij Singh Pawan Kumar Sarkar Ram Nasipuri Mita |
author_facet | Guha Ritam Ghosh Manosij Singh Pawan Kumar Sarkar Ram Nasipuri Mita |
author_sort | Guha Ritam |
collection | DOAJ |
description | The feature selection process is very important in the field of pattern recognition, which selects the informative features so as to reduce the curse of dimensionality, thus improving the overall classification accuracy. In this paper, a new feature selection approach named Memory-Based Histogram-Oriented Multi-objective Genetic Algorithm (M-HMOGA) is introduced to identify the informative feature subset to be used for a pattern classification problem. The proposed M-HMOGA approach is applied to two recently used feature sets, namely Mojette transform and Regional Weighted Run Length features. The experimentations are carried out on Bangla, Devanagari, and Roman numeral datasets, which are the three most popular scripts used in the Indian subcontinent. In-house Bangla and Devanagari script datasets and Competition on Handwritten Digit Recognition (HDRC) 2013 Roman numeral dataset are used for evaluating our model. Moreover, as proof of robustness, we have applied an innovative approach of using different datasets for training and testing. We have used in-house Bangla and Devanagari script datasets for training the model, and the trained model is then tested on Indian Statistical Institute numeral datasets. For Roman numerals, we have used the HDRC 2013 dataset for training and the Modified National Institute of Standards and Technology dataset for testing. Comparison of the results obtained by the proposed model with existing HMOGA and MOGA techniques clearly indicates the superiority of M-HMOGA over both of its ancestors. Moreover, use of K-nearest neighbor as well as multi-layer perceptron as classifiers speaks for the classifier-independent nature of M-HMOGA. The proposed M-HMOGA model uses only about 45–50% of the total feature set in order to achieve around 1% increase when the same datasets are partitioned for training-testing and a 2–3% increase in the classification ability while using only 35–45% features when different datasets are used for training-testing with respect to the situation when all the features are used for classification. |
first_indexed | 2024-12-17T20:22:21Z |
format | Article |
id | doaj.art-ae05ca35eeee410290983f9184a4502d |
institution | Directory Open Access Journal |
issn | 0334-1860 2191-026X |
language | English |
last_indexed | 2024-12-17T20:22:21Z |
publishDate | 2019-06-01 |
publisher | De Gruyter |
record_format | Article |
series | Journal of Intelligent Systems |
spelling | doaj.art-ae05ca35eeee410290983f9184a4502d2022-12-21T21:33:54ZengDe GruyterJournal of Intelligent Systems0334-18602191-026X2019-06-012911453146710.1515/jisys-2019-0064M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral ClassificationGuha Ritam0Ghosh Manosij1Singh Pawan Kumar2Sarkar Ram3Nasipuri Mita4Department of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188, Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaThe feature selection process is very important in the field of pattern recognition, which selects the informative features so as to reduce the curse of dimensionality, thus improving the overall classification accuracy. In this paper, a new feature selection approach named Memory-Based Histogram-Oriented Multi-objective Genetic Algorithm (M-HMOGA) is introduced to identify the informative feature subset to be used for a pattern classification problem. The proposed M-HMOGA approach is applied to two recently used feature sets, namely Mojette transform and Regional Weighted Run Length features. The experimentations are carried out on Bangla, Devanagari, and Roman numeral datasets, which are the three most popular scripts used in the Indian subcontinent. In-house Bangla and Devanagari script datasets and Competition on Handwritten Digit Recognition (HDRC) 2013 Roman numeral dataset are used for evaluating our model. Moreover, as proof of robustness, we have applied an innovative approach of using different datasets for training and testing. We have used in-house Bangla and Devanagari script datasets for training the model, and the trained model is then tested on Indian Statistical Institute numeral datasets. For Roman numerals, we have used the HDRC 2013 dataset for training and the Modified National Institute of Standards and Technology dataset for testing. Comparison of the results obtained by the proposed model with existing HMOGA and MOGA techniques clearly indicates the superiority of M-HMOGA over both of its ancestors. Moreover, use of K-nearest neighbor as well as multi-layer perceptron as classifiers speaks for the classifier-independent nature of M-HMOGA. The proposed M-HMOGA model uses only about 45–50% of the total feature set in order to achieve around 1% increase when the same datasets are partitioned for training-testing and a 2–3% increase in the classification ability while using only 35–45% features when different datasets are used for training-testing with respect to the situation when all the features are used for classification.https://doi.org/10.1515/jisys-2019-0064m-hmogahandwritten numeral classificationfeature selectiongenetic algorithmbangla numeralsdevanagari numeralsroman numerals |
spellingShingle | Guha Ritam Ghosh Manosij Singh Pawan Kumar Sarkar Ram Nasipuri Mita M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification Journal of Intelligent Systems m-hmoga handwritten numeral classification feature selection genetic algorithm bangla numerals devanagari numerals roman numerals |
title | M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification |
title_full | M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification |
title_fullStr | M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification |
title_full_unstemmed | M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification |
title_short | M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification |
title_sort | m hmoga a new multi objective feature selection algorithm for handwritten numeral classification |
topic | m-hmoga handwritten numeral classification feature selection genetic algorithm bangla numerals devanagari numerals roman numerals |
url | https://doi.org/10.1515/jisys-2019-0064 |
work_keys_str_mv | AT guharitam mhmogaanewmultiobjectivefeatureselectionalgorithmforhandwrittennumeralclassification AT ghoshmanosij mhmogaanewmultiobjectivefeatureselectionalgorithmforhandwrittennumeralclassification AT singhpawankumar mhmogaanewmultiobjectivefeatureselectionalgorithmforhandwrittennumeralclassification AT sarkarram mhmogaanewmultiobjectivefeatureselectionalgorithmforhandwrittennumeralclassification AT nasipurimita mhmogaanewmultiobjectivefeatureselectionalgorithmforhandwrittennumeralclassification |