A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox

Gearboxes are utilized in practically all complicated machinery equipment because they have great transmission accuracy and load capacities, so their failure frequently results in significant financial losses. The classification of high-dimensional data remains a difficult topic despite the fact tha...

Full description

Bibliographic Details
Main Authors: Di Liu, Xiangfeng Zhang, Zhiyu Zhang, Hong Jiang
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/10/4792
_version_ 1797598389445066752
author Di Liu
Xiangfeng Zhang
Zhiyu Zhang
Hong Jiang
author_facet Di Liu
Xiangfeng Zhang
Zhiyu Zhang
Hong Jiang
author_sort Di Liu
collection DOAJ
description Gearboxes are utilized in practically all complicated machinery equipment because they have great transmission accuracy and load capacities, so their failure frequently results in significant financial losses. The classification of high-dimensional data remains a difficult topic despite the fact that numerous data-driven intelligent diagnosis approaches have been suggested and employed for compound fault diagnosis in recent years with successful outcomes. In order to achieve the best diagnostic performance as the ultimate objective, a feature selection and fault decoupling framework is proposed in this paper. That is based on multi-label K-nearest neighbors (ML-kNN) as classifiers and can automatically determine the optimal subset from the original high-dimensional feature set. The proposed feature selection method is a hybrid framework that can be divided into three stages. The Fisher score, information gain, and Pearson’s correlation coefficient are three filter models that are used in the first stage to pre-rank candidate features. In the second stage, a weighting scheme based on the weighted average method is proposed to fuse the pre-ranking results obtained in the first stage and optimize the weights using a genetic algorithm to re-rank the features. The optimal subset is automatically and iteratively found in the third stage using three heuristic strategies, including binary search, sequential forward search, and sequential backward search. The method takes into account the consideration of feature irrelevance, redundancy and inter-feature interaction in the selection process, and the selected optimal subsets have better diagnostic performance. In two gearbox compound fault datasets, ML-kNN performs exceptionally well using the optimal subset with subset accuracy of 96.22% and 100%. The experimental findings demonstrate the effectiveness of the proposed method in predicting various labels for compound fault samples to identify and decouple compound faults. The proposed method performs better in terms of classification accuracy and optimal subset dimensionality when compared to other existing methods.
first_indexed 2024-03-11T03:20:29Z
format Article
id doaj.art-82e4cd898b1f462e9e419b5f30764bc5
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-11T03:20:29Z
publishDate 2023-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-82e4cd898b1f462e9e419b5f30764bc52023-11-18T03:12:39ZengMDPI AGSensors1424-82202023-05-012310479210.3390/s23104792A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for GearboxDi Liu0Xiangfeng Zhang1Zhiyu Zhang2Hong Jiang3College of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaCollege of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaCollege of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaCollege of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaGearboxes are utilized in practically all complicated machinery equipment because they have great transmission accuracy and load capacities, so their failure frequently results in significant financial losses. The classification of high-dimensional data remains a difficult topic despite the fact that numerous data-driven intelligent diagnosis approaches have been suggested and employed for compound fault diagnosis in recent years with successful outcomes. In order to achieve the best diagnostic performance as the ultimate objective, a feature selection and fault decoupling framework is proposed in this paper. That is based on multi-label K-nearest neighbors (ML-kNN) as classifiers and can automatically determine the optimal subset from the original high-dimensional feature set. The proposed feature selection method is a hybrid framework that can be divided into three stages. The Fisher score, information gain, and Pearson’s correlation coefficient are three filter models that are used in the first stage to pre-rank candidate features. In the second stage, a weighting scheme based on the weighted average method is proposed to fuse the pre-ranking results obtained in the first stage and optimize the weights using a genetic algorithm to re-rank the features. The optimal subset is automatically and iteratively found in the third stage using three heuristic strategies, including binary search, sequential forward search, and sequential backward search. The method takes into account the consideration of feature irrelevance, redundancy and inter-feature interaction in the selection process, and the selected optimal subsets have better diagnostic performance. In two gearbox compound fault datasets, ML-kNN performs exceptionally well using the optimal subset with subset accuracy of 96.22% and 100%. The experimental findings demonstrate the effectiveness of the proposed method in predicting various labels for compound fault samples to identify and decouple compound faults. The proposed method performs better in terms of classification accuracy and optimal subset dimensionality when compared to other existing methods.https://www.mdpi.com/1424-8220/23/10/4792fault diagnosiscompound fault decouplinggearboxfeature selectionmulti-label learning
spellingShingle Di Liu
Xiangfeng Zhang
Zhiyu Zhang
Hong Jiang
A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
Sensors
fault diagnosis
compound fault decoupling
gearbox
feature selection
multi-label learning
title A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
title_full A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
title_fullStr A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
title_full_unstemmed A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
title_short A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
title_sort hybrid feature selection and multi label driven intelligent fault diagnosis method for gearbox
topic fault diagnosis
compound fault decoupling
gearbox
feature selection
multi-label learning
url https://www.mdpi.com/1424-8220/23/10/4792
work_keys_str_mv AT diliu ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT xiangfengzhang ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT zhiyuzhang ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT hongjiang ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT diliu hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT xiangfengzhang hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT zhiyuzhang hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox
AT hongjiang hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox