Discriminative Adaptive Sets for Multi-Label Classification

Multi-label classification aims to associate multiple labels to a given data/object instance to better describe them. Multi-label data sets are common in a lot of emerging application areas like: Text/Multimedia classification, Bio-Informatics, Medical image annotations and Computer Vision to name a...

Full description

Bibliographic Details
Main Authors:	Muhammad Usman Ghani, Muhammad Rafi, Muhammad Atif Tahir
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Lazy learning multi-label classification instance based learning MAP estimation nearest neighbors
Online Access:	https://ieeexplore.ieee.org/document/9274293/

_version_	1828735550190780416
author	Muhammad Usman Ghani Muhammad Rafi Muhammad Atif Tahir
author_facet	Muhammad Usman Ghani Muhammad Rafi Muhammad Atif Tahir
author_sort	Muhammad Usman Ghani
collection	DOAJ
description	Multi-label classification aims to associate multiple labels to a given data/object instance to better describe them. Multi-label data sets are common in a lot of emerging application areas like: Text/Multimedia classification, Bio-Informatics, Medical image annotations and Computer Vision to name a few. There is a growing interest in efficient and accurate multi-label classification. There are two major approaches to perform multi-label classification (i) problem transformation methods and (ii) algorithm adaptation methods. In algorithm adaptation, the traditional classification algorithms are modified to handle multi-label data sets. One classification algorithm which is often modified to do multi-label classification is k- nearest neighbor (kNN). k-nearest neighbor is popular due to its simplicity, easy to implement and seamlessly adaptability. Despite its merits it has several drawbacks like: sensitivity of noisy data, missing values and outliers; feature scaling and often becoming inaccurate for large overlapping solution space. In this paper, a modification to kNN method is suggested for multi-label classification with three improvement strategies (i) selection of local example w.r.t. unknown example – the motivation for this comes from the fact that local and relevant space is vital for the improvement in multi-label classification; (ii) Splitting the input space into multiple sub-spaces for optimal label estimation – the motivation is to estimate label accurately in the presence of noisy labels; And (iii) selection of labels using Mean Average Precision (MAP) estimates – here our motivation is to utilize the training data effectively to maximize the hidden distribution and optimal parameters for the method. The proposed method is implemented and compared with state-of-the-art approaches based on kNN or similar approaches that effectively select and optimize relevant spaces for multi-label classification. Evaluation based on multiple metrics like Hamming loss, Precision/Recall and F-measure are used for evaluation. The suggested approach performed much better than the state-of-the-art on the datasets with strong label cardinalities.
first_indexed	2024-04-12T23:09:18Z
format	Article
id	doaj.art-4cb928db076347dcb266f444251f3e33
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-12T23:09:18Z
publishDate	2020-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-4cb928db076347dcb266f444251f3e332022-12-22T03:12:50ZengIEEEIEEE Access2169-35362020-01-01822757922759510.1109/ACCESS.2020.30417639274293Discriminative Adaptive Sets for Multi-Label ClassificationMuhammad Usman Ghani0https://orcid.org/0000-0002-0192-8850Muhammad Rafi1https://orcid.org/0000-0002-3673-5979Muhammad Atif Tahir2https://orcid.org/0000-0003-1366-8408School of Computer Science, National University of Computer and Emerging Sciences, Karachi Campus, Karachi, PakistanSchool of Computer Science, National University of Computer and Emerging Sciences, Karachi Campus, Karachi, PakistanSchool of Computer Science, National University of Computer and Emerging Sciences, Karachi Campus, Karachi, PakistanMulti-label classification aims to associate multiple labels to a given data/object instance to better describe them. Multi-label data sets are common in a lot of emerging application areas like: Text/Multimedia classification, Bio-Informatics, Medical image annotations and Computer Vision to name a few. There is a growing interest in efficient and accurate multi-label classification. There are two major approaches to perform multi-label classification (i) problem transformation methods and (ii) algorithm adaptation methods. In algorithm adaptation, the traditional classification algorithms are modified to handle multi-label data sets. One classification algorithm which is often modified to do multi-label classification is k- nearest neighbor (kNN). k-nearest neighbor is popular due to its simplicity, easy to implement and seamlessly adaptability. Despite its merits it has several drawbacks like: sensitivity of noisy data, missing values and outliers; feature scaling and often becoming inaccurate for large overlapping solution space. In this paper, a modification to kNN method is suggested for multi-label classification with three improvement strategies (i) selection of local example w.r.t. unknown example – the motivation for this comes from the fact that local and relevant space is vital for the improvement in multi-label classification; (ii) Splitting the input space into multiple sub-spaces for optimal label estimation – the motivation is to estimate label accurately in the presence of noisy labels; And (iii) selection of labels using Mean Average Precision (MAP) estimates – here our motivation is to utilize the training data effectively to maximize the hidden distribution and optimal parameters for the method. The proposed method is implemented and compared with state-of-the-art approaches based on kNN or similar approaches that effectively select and optimize relevant spaces for multi-label classification. Evaluation based on multiple metrics like Hamming loss, Precision/Recall and F-measure are used for evaluation. The suggested approach performed much better than the state-of-the-art on the datasets with strong label cardinalities.https://ieeexplore.ieee.org/document/9274293/Lazy learningmulti-label classificationinstance based learningMAP estimationnearest neighbors
spellingShingle	Muhammad Usman Ghani Muhammad Rafi Muhammad Atif Tahir Discriminative Adaptive Sets for Multi-Label Classification IEEE Access Lazy learning multi-label classification instance based learning MAP estimation nearest neighbors
title	Discriminative Adaptive Sets for Multi-Label Classification
title_full	Discriminative Adaptive Sets for Multi-Label Classification
title_fullStr	Discriminative Adaptive Sets for Multi-Label Classification
title_full_unstemmed	Discriminative Adaptive Sets for Multi-Label Classification
title_short	Discriminative Adaptive Sets for Multi-Label Classification
title_sort	discriminative adaptive sets for multi label classification
topic	Lazy learning multi-label classification instance based learning MAP estimation nearest neighbors
url	https://ieeexplore.ieee.org/document/9274293/
work_keys_str_mv	AT muhammadusmanghani discriminativeadaptivesetsformultilabelclassification AT muhammadrafi discriminativeadaptivesetsformultilabelclassification AT muhammadatiftahir discriminativeadaptivesetsformultilabelclassification

Discriminative Adaptive Sets for Multi-Label Classification

Similar Items