A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
Gearboxes are utilized in practically all complicated machinery equipment because they have great transmission accuracy and load capacities, so their failure frequently results in significant financial losses. The classification of high-dimensional data remains a difficult topic despite the fact tha...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-05-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/23/10/4792 |
_version_ | 1797598389445066752 |
---|---|
author | Di Liu Xiangfeng Zhang Zhiyu Zhang Hong Jiang |
author_facet | Di Liu Xiangfeng Zhang Zhiyu Zhang Hong Jiang |
author_sort | Di Liu |
collection | DOAJ |
description | Gearboxes are utilized in practically all complicated machinery equipment because they have great transmission accuracy and load capacities, so their failure frequently results in significant financial losses. The classification of high-dimensional data remains a difficult topic despite the fact that numerous data-driven intelligent diagnosis approaches have been suggested and employed for compound fault diagnosis in recent years with successful outcomes. In order to achieve the best diagnostic performance as the ultimate objective, a feature selection and fault decoupling framework is proposed in this paper. That is based on multi-label K-nearest neighbors (ML-kNN) as classifiers and can automatically determine the optimal subset from the original high-dimensional feature set. The proposed feature selection method is a hybrid framework that can be divided into three stages. The Fisher score, information gain, and Pearson’s correlation coefficient are three filter models that are used in the first stage to pre-rank candidate features. In the second stage, a weighting scheme based on the weighted average method is proposed to fuse the pre-ranking results obtained in the first stage and optimize the weights using a genetic algorithm to re-rank the features. The optimal subset is automatically and iteratively found in the third stage using three heuristic strategies, including binary search, sequential forward search, and sequential backward search. The method takes into account the consideration of feature irrelevance, redundancy and inter-feature interaction in the selection process, and the selected optimal subsets have better diagnostic performance. In two gearbox compound fault datasets, ML-kNN performs exceptionally well using the optimal subset with subset accuracy of 96.22% and 100%. The experimental findings demonstrate the effectiveness of the proposed method in predicting various labels for compound fault samples to identify and decouple compound faults. The proposed method performs better in terms of classification accuracy and optimal subset dimensionality when compared to other existing methods. |
first_indexed | 2024-03-11T03:20:29Z |
format | Article |
id | doaj.art-82e4cd898b1f462e9e419b5f30764bc5 |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-11T03:20:29Z |
publishDate | 2023-05-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-82e4cd898b1f462e9e419b5f30764bc52023-11-18T03:12:39ZengMDPI AGSensors1424-82202023-05-012310479210.3390/s23104792A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for GearboxDi Liu0Xiangfeng Zhang1Zhiyu Zhang2Hong Jiang3College of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaCollege of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaCollege of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaCollege of Intelligent Manufacturing and Industrial Modernization, Xinjiang University, Urumchi 830017, ChinaGearboxes are utilized in practically all complicated machinery equipment because they have great transmission accuracy and load capacities, so their failure frequently results in significant financial losses. The classification of high-dimensional data remains a difficult topic despite the fact that numerous data-driven intelligent diagnosis approaches have been suggested and employed for compound fault diagnosis in recent years with successful outcomes. In order to achieve the best diagnostic performance as the ultimate objective, a feature selection and fault decoupling framework is proposed in this paper. That is based on multi-label K-nearest neighbors (ML-kNN) as classifiers and can automatically determine the optimal subset from the original high-dimensional feature set. The proposed feature selection method is a hybrid framework that can be divided into three stages. The Fisher score, information gain, and Pearson’s correlation coefficient are three filter models that are used in the first stage to pre-rank candidate features. In the second stage, a weighting scheme based on the weighted average method is proposed to fuse the pre-ranking results obtained in the first stage and optimize the weights using a genetic algorithm to re-rank the features. The optimal subset is automatically and iteratively found in the third stage using three heuristic strategies, including binary search, sequential forward search, and sequential backward search. The method takes into account the consideration of feature irrelevance, redundancy and inter-feature interaction in the selection process, and the selected optimal subsets have better diagnostic performance. In two gearbox compound fault datasets, ML-kNN performs exceptionally well using the optimal subset with subset accuracy of 96.22% and 100%. The experimental findings demonstrate the effectiveness of the proposed method in predicting various labels for compound fault samples to identify and decouple compound faults. The proposed method performs better in terms of classification accuracy and optimal subset dimensionality when compared to other existing methods.https://www.mdpi.com/1424-8220/23/10/4792fault diagnosiscompound fault decouplinggearboxfeature selectionmulti-label learning |
spellingShingle | Di Liu Xiangfeng Zhang Zhiyu Zhang Hong Jiang A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox Sensors fault diagnosis compound fault decoupling gearbox feature selection multi-label learning |
title | A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox |
title_full | A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox |
title_fullStr | A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox |
title_full_unstemmed | A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox |
title_short | A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox |
title_sort | hybrid feature selection and multi label driven intelligent fault diagnosis method for gearbox |
topic | fault diagnosis compound fault decoupling gearbox feature selection multi-label learning |
url | https://www.mdpi.com/1424-8220/23/10/4792 |
work_keys_str_mv | AT diliu ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT xiangfengzhang ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT zhiyuzhang ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT hongjiang ahybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT diliu hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT xiangfengzhang hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT zhiyuzhang hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox AT hongjiang hybridfeatureselectionandmultilabeldrivenintelligentfaultdiagnosismethodforgearbox |