SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection

In industrial production processes, the data usually have high-dimensional characteristics. When a support vector machine (SVM) is used for fault detection, it takes a long time to run. For high-dimensional data, principal component analysis (PCA) and independent component analysis (ICA) are very ef...

Full description

Bibliographic Details
Main Authors: Jinyu Guo, Tao Li, Yuan Li
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9411859/
_version_ 1811274325048688640
author Jinyu Guo
Tao Li
Yuan Li
author_facet Jinyu Guo
Tao Li
Yuan Li
author_sort Jinyu Guo
collection DOAJ
description In industrial production processes, the data usually have high-dimensional characteristics. When a support vector machine (SVM) is used for fault detection, it takes a long time to run. For high-dimensional data, principal component analysis (PCA) and independent component analysis (ICA) are very effective dimensionality reduction algorithms. Since the data follow different distributions in industrial processes, PCA is commonly used to process Gaussian-distributed data, and ICA is used to process non-Gaussian distributed data. Addressing these limitations, a novel fault detection method of SVM based on Gaussian and non-Gaussian double subspace (DSSVM) is proposed. The Kolmogorov-Smirnov (KS) test is used to determine the normal distribution characteristics of the process variables in the original data. The process variables are divided into a Gaussian subspace and a non-Gaussian subspace. Fault detection models of the Gaussian subspace are established based on PCA, and models of the non-Gaussian subspace are based on ICA. To reduce the effect of autocorrelation among process variables on the SVM, the delay and time difference input characteristics are introduced into the principal components obtained in the Gaussian subspace and independent components obtained in the non-Gaussian subspace. Finally, the time delay matrix and time difference matrix are combined, and the SVM model is used for fault detection and monitoring. The DSSVM method reduces the data dimensions and eliminates the effect of autocorrelation among process variables on the detection results. The proposed method is applied to a multivariable numerical simulation and the Tennessee-Eastman industrial process. Comparisons with simulation results for the kernel principal component analysis (KPCA), kernel independent component analysis (KICA), SVM and the statistical process monitoring method based on variable distribution characteristics (VDSPM) further verify the effectiveness of the algorithm.
first_indexed 2024-04-12T23:16:59Z
format Article
id doaj.art-2e93ac46c038485f955d6c5443fd7dbf
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T23:16:59Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-2e93ac46c038485f955d6c5443fd7dbf2022-12-22T03:12:38ZengIEEEIEEE Access2169-35362021-01-019665196653010.1109/ACCESS.2021.30752739411859SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault DetectionJinyu Guo0Tao Li1https://orcid.org/0000-0001-9187-2282Yuan Li2College of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaCollege of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaCollege of Information Engineering, Shenyang University of Chemical Technology, Shenyang, ChinaIn industrial production processes, the data usually have high-dimensional characteristics. When a support vector machine (SVM) is used for fault detection, it takes a long time to run. For high-dimensional data, principal component analysis (PCA) and independent component analysis (ICA) are very effective dimensionality reduction algorithms. Since the data follow different distributions in industrial processes, PCA is commonly used to process Gaussian-distributed data, and ICA is used to process non-Gaussian distributed data. Addressing these limitations, a novel fault detection method of SVM based on Gaussian and non-Gaussian double subspace (DSSVM) is proposed. The Kolmogorov-Smirnov (KS) test is used to determine the normal distribution characteristics of the process variables in the original data. The process variables are divided into a Gaussian subspace and a non-Gaussian subspace. Fault detection models of the Gaussian subspace are established based on PCA, and models of the non-Gaussian subspace are based on ICA. To reduce the effect of autocorrelation among process variables on the SVM, the delay and time difference input characteristics are introduced into the principal components obtained in the Gaussian subspace and independent components obtained in the non-Gaussian subspace. Finally, the time delay matrix and time difference matrix are combined, and the SVM model is used for fault detection and monitoring. The DSSVM method reduces the data dimensions and eliminates the effect of autocorrelation among process variables on the detection results. The proposed method is applied to a multivariable numerical simulation and the Tennessee-Eastman industrial process. Comparisons with simulation results for the kernel principal component analysis (KPCA), kernel independent component analysis (KICA), SVM and the statistical process monitoring method based on variable distribution characteristics (VDSPM) further verify the effectiveness of the algorithm.https://ieeexplore.ieee.org/document/9411859/Fault detectionKolmogorov-Smirnov testprincipal component analysisindependent component analysissupport vector machine
spellingShingle Jinyu Guo
Tao Li
Yuan Li
SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection
IEEE Access
Fault detection
Kolmogorov-Smirnov test
principal component analysis
independent component analysis
support vector machine
title SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection
title_full SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection
title_fullStr SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection
title_full_unstemmed SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection
title_short SVM Based on Gaussian and Non-Gaussian Double Subspace for Fault Detection
title_sort svm based on gaussian and non gaussian double subspace for fault detection
topic Fault detection
Kolmogorov-Smirnov test
principal component analysis
independent component analysis
support vector machine
url https://ieeexplore.ieee.org/document/9411859/
work_keys_str_mv AT jinyuguo svmbasedongaussianandnongaussiandoublesubspaceforfaultdetection
AT taoli svmbasedongaussianandnongaussiandoublesubspaceforfaultdetection
AT yuanli svmbasedongaussianandnongaussiandoublesubspaceforfaultdetection