Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators

The number of normal samples of wind turbine generators is much larger than the number of fault samples. To solve the problem of imbalanced classification in wind turbine generator fault detection, a cost-sensitive extremely randomized trees (CS-ERT) algorithm is proposed in this paper, in which the...

Full description

Bibliographic Details
Main Authors: Mingzhu Tang, Yutao Chen, Huawei Wu, Qi Zhao, Wen Long, Victor S. Sheng, Jiabiao Yi
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-05-01
Series:Frontiers in Energy Research
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fenrg.2021.686616/full
_version_ 1818606828307611648
author Mingzhu Tang
Yutao Chen
Huawei Wu
Qi Zhao
Wen Long
Victor S. Sheng
Jiabiao Yi
author_facet Mingzhu Tang
Yutao Chen
Huawei Wu
Qi Zhao
Wen Long
Victor S. Sheng
Jiabiao Yi
author_sort Mingzhu Tang
collection DOAJ
description The number of normal samples of wind turbine generators is much larger than the number of fault samples. To solve the problem of imbalanced classification in wind turbine generator fault detection, a cost-sensitive extremely randomized trees (CS-ERT) algorithm is proposed in this paper, in which the cost-sensitive learning method is introduced into an extremely randomized trees (ERT) algorithm. Based on the classification misclassification cost and class distribution, the misclassification cost gain (MCG) is proposed as the score measure of the CS-ERT model growth process to improve the classification accuracy of minority classes. The Hilbert-Schmidt independence criterion lasso (HSICLasso) feature selection method is used to select strongly correlated non-redundant features of doubly-fed wind turbine generators. The effectiveness of the method was verified by experiments on four different failure datasets of wind turbine generators. The experiment results show that average missing detection rate, average misclassification cost and gMean of the improved algorithm better than those of the ERT algorithm. In addition, compared with the CSForest, AdaCost and MetaCost methods, the proposed method has better real-time fault detection performance.
first_indexed 2024-12-16T14:17:03Z
format Article
id doaj.art-d49bc9dfa9ee41c4bdfa206013d1d186
institution Directory Open Access Journal
issn 2296-598X
language English
last_indexed 2024-12-16T14:17:03Z
publishDate 2021-05-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Energy Research
spelling doaj.art-d49bc9dfa9ee41c4bdfa206013d1d1862022-12-21T22:28:34ZengFrontiers Media S.A.Frontiers in Energy Research2296-598X2021-05-01910.3389/fenrg.2021.686616686616Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine GeneratorsMingzhu Tang0Yutao Chen1Huawei Wu2Qi Zhao3Wen Long4Victor S. Sheng5Jiabiao Yi6School of Energy and Power Engineering, Changsha University of Science and Technology, Changsha, ChinaSchool of Energy and Power Engineering, Changsha University of Science and Technology, Changsha, ChinaHubei Key Laboratory of Power System Design and Test for Electrical Vehicle, Hubei University of Arts and Science, Xiangyang, ChinaSchool of Energy and Power Engineering, Changsha University of Science and Technology, Changsha, ChinaGuizhou Key Laboratory of Economics System Simulation, Guizhou University of Finance and Economics, Guiyang, ChinaComputer Science Department, Texas Tech University, Lubbock, TX, United StatesSchool of Energy and Power Engineering, Changsha University of Science and Technology, Changsha, ChinaThe number of normal samples of wind turbine generators is much larger than the number of fault samples. To solve the problem of imbalanced classification in wind turbine generator fault detection, a cost-sensitive extremely randomized trees (CS-ERT) algorithm is proposed in this paper, in which the cost-sensitive learning method is introduced into an extremely randomized trees (ERT) algorithm. Based on the classification misclassification cost and class distribution, the misclassification cost gain (MCG) is proposed as the score measure of the CS-ERT model growth process to improve the classification accuracy of minority classes. The Hilbert-Schmidt independence criterion lasso (HSICLasso) feature selection method is used to select strongly correlated non-redundant features of doubly-fed wind turbine generators. The effectiveness of the method was verified by experiments on four different failure datasets of wind turbine generators. The experiment results show that average missing detection rate, average misclassification cost and gMean of the improved algorithm better than those of the ERT algorithm. In addition, compared with the CSForest, AdaCost and MetaCost methods, the proposed method has better real-time fault detection performance.https://www.frontiersin.org/articles/10.3389/fenrg.2021.686616/fullfault detectionfault diagnosiscost-sensitive learningextremely randomized treesclass imbalancewind turbine generator
spellingShingle Mingzhu Tang
Yutao Chen
Huawei Wu
Qi Zhao
Wen Long
Victor S. Sheng
Jiabiao Yi
Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators
Frontiers in Energy Research
fault detection
fault diagnosis
cost-sensitive learning
extremely randomized trees
class imbalance
wind turbine generator
title Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators
title_full Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators
title_fullStr Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators
title_full_unstemmed Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators
title_short Cost-Sensitive Extremely Randomized Trees Algorithm for Online Fault Detection of Wind Turbine Generators
title_sort cost sensitive extremely randomized trees algorithm for online fault detection of wind turbine generators
topic fault detection
fault diagnosis
cost-sensitive learning
extremely randomized trees
class imbalance
wind turbine generator
url https://www.frontiersin.org/articles/10.3389/fenrg.2021.686616/full
work_keys_str_mv AT mingzhutang costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators
AT yutaochen costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators
AT huaweiwu costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators
AT qizhao costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators
AT wenlong costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators
AT victorssheng costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators
AT jiabiaoyi costsensitiveextremelyrandomizedtreesalgorithmforonlinefaultdetectionofwindturbinegenerators