An Instance- and Label-Based Feature Selection Method in Classification Tasks

Feature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each...

Full description

Bibliographic Details
Main Authors: Qingcheng Fan, Sicong Liu, Chunjiang Zhao, Shuqin Li
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/14/10/532
_version_ 1797573559896244224
author Qingcheng Fan
Sicong Liu
Chunjiang Zhao
Shuqin Li
author_facet Qingcheng Fan
Sicong Liu
Chunjiang Zhao
Shuqin Li
author_sort Qingcheng Fan
collection DOAJ
description Feature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each feature and the target variable, resulting in correlation coefficients. Features with high correlation coefficients are selected. Compared to traditional methods, our approach offers two advantages. Firstly, it effectively selects features highly correlated with the target variable from a large feature set, reducing data dimensionality and improving analysis and modeling efficiency. Secondly, our method considers label correlation between features, enhancing the accuracy of selected features and subsequent model performance. Experimental results on three datasets demonstrate the effectiveness of our method in selecting features with high correlation coefficients, leading to superior model performance. Notably, our approach achieves a minimum accuracy improvement of 3.2% for the advanced classifier, lightGBM, surpassing other feature selection methods. In summary, our proposed method, based on instance and label correlation, presents a suitable solution for classification problems.
first_indexed 2024-03-10T21:10:48Z
format Article
id doaj.art-6a2f707dde3f4f09a367fd0c9c0e0757
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-10T21:10:48Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-6a2f707dde3f4f09a367fd0c9c0e07572023-11-19T16:47:48ZengMDPI AGInformation2078-24892023-09-01141053210.3390/info14100532An Instance- and Label-Based Feature Selection Method in Classification TasksQingcheng Fan0Sicong Liu1Chunjiang Zhao2Shuqin Li3College of Information Engineering, Northwest A&F University, 3 Taicheng Road, Yangling, Xianyang 712100, ChinaCollege of Information Engineering, Northwest A&F University, 3 Taicheng Road, Yangling, Xianyang 712100, ChinaCollege of Information Engineering, Northwest A&F University, 3 Taicheng Road, Yangling, Xianyang 712100, ChinaCollege of Information Engineering, Northwest A&F University, 3 Taicheng Road, Yangling, Xianyang 712100, ChinaFeature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each feature and the target variable, resulting in correlation coefficients. Features with high correlation coefficients are selected. Compared to traditional methods, our approach offers two advantages. Firstly, it effectively selects features highly correlated with the target variable from a large feature set, reducing data dimensionality and improving analysis and modeling efficiency. Secondly, our method considers label correlation between features, enhancing the accuracy of selected features and subsequent model performance. Experimental results on three datasets demonstrate the effectiveness of our method in selecting features with high correlation coefficients, leading to superior model performance. Notably, our approach achieves a minimum accuracy improvement of 3.2% for the advanced classifier, lightGBM, surpassing other feature selection methods. In summary, our proposed method, based on instance and label correlation, presents a suitable solution for classification problems.https://www.mdpi.com/2078-2489/14/10/532feature selectionmanifold learningclassification
spellingShingle Qingcheng Fan
Sicong Liu
Chunjiang Zhao
Shuqin Li
An Instance- and Label-Based Feature Selection Method in Classification Tasks
Information
feature selection
manifold learning
classification
title An Instance- and Label-Based Feature Selection Method in Classification Tasks
title_full An Instance- and Label-Based Feature Selection Method in Classification Tasks
title_fullStr An Instance- and Label-Based Feature Selection Method in Classification Tasks
title_full_unstemmed An Instance- and Label-Based Feature Selection Method in Classification Tasks
title_short An Instance- and Label-Based Feature Selection Method in Classification Tasks
title_sort instance and label based feature selection method in classification tasks
topic feature selection
manifold learning
classification
url https://www.mdpi.com/2078-2489/14/10/532
work_keys_str_mv AT qingchengfan aninstanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT sicongliu aninstanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT chunjiangzhao aninstanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT shuqinli aninstanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT qingchengfan instanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT sicongliu instanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT chunjiangzhao instanceandlabelbasedfeatureselectionmethodinclassificationtasks
AT shuqinli instanceandlabelbasedfeatureselectionmethodinclassificationtasks