Information Theoretic Multi-Target Feature Selection via Output Space Quantization

A key challenge in information theoretic feature selection is to estimate mutual information expressions that capture three desirable terms—the relevancy of a feature with the output, the redundancy and the complementarity between groups of features. The challenge becomes more pronounced i...

Full description

Bibliographic Details
Main Authors: Konstantinos Sechidis, Eleftherios Spyromitros-Xioufis, Ioannis Vlahavas
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/21/9/855
_version_ 1818036898258485248
author Konstantinos Sechidis
Eleftherios Spyromitros-Xioufis
Ioannis Vlahavas
author_facet Konstantinos Sechidis
Eleftherios Spyromitros-Xioufis
Ioannis Vlahavas
author_sort Konstantinos Sechidis
collection DOAJ
description A key challenge in information theoretic feature selection is to estimate mutual information expressions that capture three desirable terms&#8212;the relevancy of a feature with the output, the redundancy and the complementarity between groups of features. The challenge becomes more pronounced in multi-target problems, where the output space is multi-dimensional. Our work presents an algorithm that captures these three desirable terms and is suitable for the well-known multi-target prediction settings of multi-label/dimensional classification and multivariate regression. We achieve this by combining two ideas&#8212;deriving low-order information theoretic approximations for the input space and using quantization algorithms for deriving low-dimensional approximations of the output space. Under the above framework we derive a novel criterion, <i>G</i>roup-JMI-Rand, which captures various high-order target interactions. In an extensive experimental study we showed that our suggested criterion achieves competing performance against various other information theoretic feature selection criteria suggested in the literature.
first_indexed 2024-12-10T07:18:16Z
format Article
id doaj.art-98f447350bee4e178ff79c6e4c64e9b8
institution Directory Open Access Journal
issn 1099-4300
language English
last_indexed 2024-12-10T07:18:16Z
publishDate 2019-08-01
publisher MDPI AG
record_format Article
series Entropy
spelling doaj.art-98f447350bee4e178ff79c6e4c64e9b82022-12-22T01:57:53ZengMDPI AGEntropy1099-43002019-08-0121985510.3390/e21090855e21090855Information Theoretic Multi-Target Feature Selection via Output Space QuantizationKonstantinos Sechidis0Eleftherios Spyromitros-Xioufis1Ioannis Vlahavas2Department of Computer Science, Aristotle University, 54124 Thessaloniki, GreeceDepartment of Computer Science, Aristotle University, 54124 Thessaloniki, GreeceDepartment of Computer Science, Aristotle University, 54124 Thessaloniki, GreeceA key challenge in information theoretic feature selection is to estimate mutual information expressions that capture three desirable terms&#8212;the relevancy of a feature with the output, the redundancy and the complementarity between groups of features. The challenge becomes more pronounced in multi-target problems, where the output space is multi-dimensional. Our work presents an algorithm that captures these three desirable terms and is suitable for the well-known multi-target prediction settings of multi-label/dimensional classification and multivariate regression. We achieve this by combining two ideas&#8212;deriving low-order information theoretic approximations for the input space and using quantization algorithms for deriving low-dimensional approximations of the output space. Under the above framework we derive a novel criterion, <i>G</i>roup-JMI-Rand, which captures various high-order target interactions. In an extensive experimental study we showed that our suggested criterion achieves competing performance against various other information theoretic feature selection criteria suggested in the literature.https://www.mdpi.com/1099-4300/21/9/855feature selectionmutual informationmulti-targetmulti-labelclustering
spellingShingle Konstantinos Sechidis
Eleftherios Spyromitros-Xioufis
Ioannis Vlahavas
Information Theoretic Multi-Target Feature Selection via Output Space Quantization
Entropy
feature selection
mutual information
multi-target
multi-label
clustering
title Information Theoretic Multi-Target Feature Selection via Output Space Quantization
title_full Information Theoretic Multi-Target Feature Selection via Output Space Quantization
title_fullStr Information Theoretic Multi-Target Feature Selection via Output Space Quantization
title_full_unstemmed Information Theoretic Multi-Target Feature Selection via Output Space Quantization
title_short Information Theoretic Multi-Target Feature Selection via Output Space Quantization
title_sort information theoretic multi target feature selection via output space quantization
topic feature selection
mutual information
multi-target
multi-label
clustering
url https://www.mdpi.com/1099-4300/21/9/855
work_keys_str_mv AT konstantinossechidis informationtheoreticmultitargetfeatureselectionviaoutputspacequantization
AT eleftheriosspyromitrosxioufis informationtheoreticmultitargetfeatureselectionviaoutputspacequantization
AT ioannisvlahavas informationtheoreticmultitargetfeatureselectionviaoutputspacequantization