Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application

Multi-instance Learning (MIL) is widely applied in image classification. In MIL, an image is presented as a bag. A bag consists of multi-instance which is known as patches. Irrelevant features of the image presented to the classifier affects the classification performance. Feature selection is one o...

Full description

Bibliographic Details
Main Authors: Mazniha Berahim, Mazniha Berahim, Samsudin, Noor Azah, Mustapha, Aida, Mohd Salleh, Rohayu, Mohd Nasi, Muhammad Jaffri
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:http://eprints.uthm.edu.my/11058/1/J17533_bdc74aa47f8929110c75df69a7d6f211.pdf
_version_ 1811134713418481664
author Mazniha Berahim, Mazniha Berahim
Samsudin, Noor Azah
Mustapha, Aida
Mohd Salleh, Rohayu
Mohd Nasi, Muhammad Jaffri
author_facet Mazniha Berahim, Mazniha Berahim
Samsudin, Noor Azah
Mustapha, Aida
Mohd Salleh, Rohayu
Mohd Nasi, Muhammad Jaffri
author_sort Mazniha Berahim, Mazniha Berahim
collection UTHM
description Multi-instance Learning (MIL) is widely applied in image classification. In MIL, an image is presented as a bag. A bag consists of multi-instance which is known as patches. Irrelevant features of the image presented to the classifier affects the classification performance. Feature selection is one of the essential phases to select relevant. However, limited studies discuss the feature selection phase in MIL. Correlation between feature-class (FC) relationship is one important criterion to analyse features’ relevance. However, it cannot be performed directly in MIL. To address this gap, this study proposed the MultiBag-FCCorr feature selection technique. It consists of three steps: transformation, evaluation and fusion. The bags of feature information are acquired from summarization from different statistical central tendency measures of trimmed mean, mode and median. In feature evaluation step, extended point biserial correlation has been used to measure FC correlation and then the FC score has been analysed. The selected features are validated via two prominent classifiers (Support Vector Machine (SVM) and K-Nearest Neighbour (KNN)) on benchmark MI image datasets: UCSB Breast Cancer, Tiger, Elephant and Fox datasets. The selected features of UCSB Breast Cancer dataset were reduced to 92% number of features from the proposed technique giving the best result of average accuracy with 86.8.% using SVM and 84.5% using KNN. The average accuracy improved 6.3% using SVM and 16.4% using KNN compared without implementing the proposed feature selection. The results proved that the selected feature set improved the performance of MI image classification.
first_indexed 2024-09-24T00:09:23Z
format Article
id uthm.eprints-11058
institution Universiti Tun Hussein Onn Malaysia
language English
last_indexed 2024-09-24T00:09:23Z
publishDate 2024
record_format dspace
spelling uthm.eprints-110582024-06-04T02:45:41Z http://eprints.uthm.edu.my/11058/ Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application Mazniha Berahim, Mazniha Berahim Samsudin, Noor Azah Mustapha, Aida Mohd Salleh, Rohayu Mohd Nasi, Muhammad Jaffri QA76 Computer software Multi-instance Learning (MIL) is widely applied in image classification. In MIL, an image is presented as a bag. A bag consists of multi-instance which is known as patches. Irrelevant features of the image presented to the classifier affects the classification performance. Feature selection is one of the essential phases to select relevant. However, limited studies discuss the feature selection phase in MIL. Correlation between feature-class (FC) relationship is one important criterion to analyse features’ relevance. However, it cannot be performed directly in MIL. To address this gap, this study proposed the MultiBag-FCCorr feature selection technique. It consists of three steps: transformation, evaluation and fusion. The bags of feature information are acquired from summarization from different statistical central tendency measures of trimmed mean, mode and median. In feature evaluation step, extended point biserial correlation has been used to measure FC correlation and then the FC score has been analysed. The selected features are validated via two prominent classifiers (Support Vector Machine (SVM) and K-Nearest Neighbour (KNN)) on benchmark MI image datasets: UCSB Breast Cancer, Tiger, Elephant and Fox datasets. The selected features of UCSB Breast Cancer dataset were reduced to 92% number of features from the proposed technique giving the best result of average accuracy with 86.8.% using SVM and 84.5% using KNN. The average accuracy improved 6.3% using SVM and 16.4% using KNN compared without implementing the proposed feature selection. The results proved that the selected feature set improved the performance of MI image classification. 2024 Article PeerReviewed text en http://eprints.uthm.edu.my/11058/1/J17533_bdc74aa47f8929110c75df69a7d6f211.pdf Mazniha Berahim, Mazniha Berahim and Samsudin, Noor Azah and Mustapha, Aida and Mohd Salleh, Rohayu and Mohd Nasi, Muhammad Jaffri (2024) Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application. COMPENDIUM by paperASIA. pp. 51-61.
spellingShingle QA76 Computer software
Mazniha Berahim, Mazniha Berahim
Samsudin, Noor Azah
Mustapha, Aida
Mohd Salleh, Rohayu
Mohd Nasi, Muhammad Jaffri
Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application
title Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application
title_full Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application
title_fullStr Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application
title_full_unstemmed Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application
title_short Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application
title_sort bag based feature class correlation analysis for multi instance learning application
topic QA76 Computer software
url http://eprints.uthm.edu.my/11058/1/J17533_bdc74aa47f8929110c75df69a7d6f211.pdf
work_keys_str_mv AT maznihaberahimmaznihaberahim bagbasedfeatureclasscorrelationanalysisformultiinstancelearningapplication
AT samsudinnoorazah bagbasedfeatureclasscorrelationanalysisformultiinstancelearningapplication
AT mustaphaaida bagbasedfeatureclasscorrelationanalysisformultiinstancelearningapplication
AT mohdsallehrohayu bagbasedfeatureclasscorrelationanalysisformultiinstancelearningapplication
AT mohdnasimuhammadjaffri bagbasedfeatureclasscorrelationanalysisformultiinstancelearningapplication