A Novel Approach to Increase the Efficiency of Filter-Based Feature Selection Methods in High-Dimensional Datasets With Strong Correlation Structure

Nowadays, data dimensions have increased depending on the developments in information and measurement technologies. Due to the high dimensionality, it is necessary to use pre-analysis data reduction methods for many analyzes such as classification and regression analysis. In the solution of high-dim...

Full description

Bibliographic Details
Main Author: Serkan Akogul
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10286827/
Description
Summary:Nowadays, data dimensions have increased depending on the developments in information and measurement technologies. Due to the high dimensionality, it is necessary to use pre-analysis data reduction methods for many analyzes such as classification and regression analysis. In the solution of high-dimensionality, filter feature selection methods based on statistical criteria are widely used in terms of simplicity and efficiency. One of the important problems with filter feature selection methods is the selection of multiple features carrying the same information unnecessarily when strong correlations exist between features. In this study, a novel approach is proposed to solve this problem of filter feature selection methods. In addition, with the proposed new approach, the question of how many appropriate features will be included is also solved. The performance of the proposed approach is demonstrated on high-dimensional reflectance data with high correlations between features. The results obtained revealed that the proposed approach improves the classification performance of filter feature selection methods in mixture discriminant analysis in terms of classification accuracy and entropy criteria.
ISSN:2169-3536