Designing a supervised feature selection technique for mixed attribute data analysis

Identifying optimal features is critical for increasing the overall performance of data classification. This paper introduces a supervised feature selection technique for analyzing mixed attribute data. It measures data classification performances of features with a user-defined performance criterio...

Full description

Bibliographic Details
Main Authors: Dong Hyun Jeong, Bong Keun Jeong, Nandi Leslie, Charles Kamhoua, Soo-Yeon Ji
Format: Article
Language:English
Published: Elsevier 2022-12-01
Series:Machine Learning with Applications
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666827022001062
_version_ 1811199161929826304
author Dong Hyun Jeong
Bong Keun Jeong
Nandi Leslie
Charles Kamhoua
Soo-Yeon Ji
author_facet Dong Hyun Jeong
Bong Keun Jeong
Nandi Leslie
Charles Kamhoua
Soo-Yeon Ji
author_sort Dong Hyun Jeong
collection DOAJ
description Identifying optimal features is critical for increasing the overall performance of data classification. This paper introduces a supervised feature selection technique for analyzing mixed attribute data. It measures data classification performances of features with a user-defined performance criterion and determines optimal features to boost the overall data analysis performance. A performance evaluation is managed to highlight the usefulness of the technique with existing feature selection techniques such as analysis of variance test, chi-square test, principal component analysis, and mutual information. Visualization is also utilized to understand the differences in classifying instances with different features. From a comparative performance testing and evaluation, we found 5 ∼ 10% performance improvements with the proposed technique. Overall, evaluation results showed the usefulness of our proposed feature selection technique in mixed attribute data analysis.
first_indexed 2024-04-12T01:43:21Z
format Article
id doaj.art-e0e0fbdc96384a0088a223dd50338957
institution Directory Open Access Journal
issn 2666-8270
language English
last_indexed 2024-04-12T01:43:21Z
publishDate 2022-12-01
publisher Elsevier
record_format Article
series Machine Learning with Applications
spelling doaj.art-e0e0fbdc96384a0088a223dd503389572022-12-22T03:53:08ZengElsevierMachine Learning with Applications2666-82702022-12-0110100431Designing a supervised feature selection technique for mixed attribute data analysisDong Hyun Jeong0Bong Keun Jeong1Nandi Leslie2Charles Kamhoua3Soo-Yeon Ji4Department of Computer Science and Information Technology, University of the District of Columbia, DC, USA; Corresponding authors.Department of Management and Decision Sciences, Coastal Carolina University, SC, USARaytheon Technologies, MD, USAU.S. Army Research Laboratory (ARL), Adelphi, MD, USADepartment of Computer Science, Bowie State University, MD, USA; Corresponding authors.Identifying optimal features is critical for increasing the overall performance of data classification. This paper introduces a supervised feature selection technique for analyzing mixed attribute data. It measures data classification performances of features with a user-defined performance criterion and determines optimal features to boost the overall data analysis performance. A performance evaluation is managed to highlight the usefulness of the technique with existing feature selection techniques such as analysis of variance test, chi-square test, principal component analysis, and mutual information. Visualization is also utilized to understand the differences in classifying instances with different features. From a comparative performance testing and evaluation, we found 5 ∼ 10% performance improvements with the proposed technique. Overall, evaluation results showed the usefulness of our proposed feature selection technique in mixed attribute data analysis.http://www.sciencedirect.com/science/article/pii/S2666827022001062Mixed-attribute data analysisSupervised feature selectionMachine learningVisual analysis
spellingShingle Dong Hyun Jeong
Bong Keun Jeong
Nandi Leslie
Charles Kamhoua
Soo-Yeon Ji
Designing a supervised feature selection technique for mixed attribute data analysis
Machine Learning with Applications
Mixed-attribute data analysis
Supervised feature selection
Machine learning
Visual analysis
title Designing a supervised feature selection technique for mixed attribute data analysis
title_full Designing a supervised feature selection technique for mixed attribute data analysis
title_fullStr Designing a supervised feature selection technique for mixed attribute data analysis
title_full_unstemmed Designing a supervised feature selection technique for mixed attribute data analysis
title_short Designing a supervised feature selection technique for mixed attribute data analysis
title_sort designing a supervised feature selection technique for mixed attribute data analysis
topic Mixed-attribute data analysis
Supervised feature selection
Machine learning
Visual analysis
url http://www.sciencedirect.com/science/article/pii/S2666827022001062
work_keys_str_mv AT donghyunjeong designingasupervisedfeatureselectiontechniqueformixedattributedataanalysis
AT bongkeunjeong designingasupervisedfeatureselectiontechniqueformixedattributedataanalysis
AT nandileslie designingasupervisedfeatureselectiontechniqueformixedattributedataanalysis
AT charleskamhoua designingasupervisedfeatureselectiontechniqueformixedattributedataanalysis
AT sooyeonji designingasupervisedfeatureselectiontechniqueformixedattributedataanalysis