Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
Kernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9241766/ |
_version_ | 1818617216670629888 |
---|---|
author | Ruomu Tan James R. Ottewill Nina F. Thornhill |
author_facet | Ruomu Tan James R. Ottewill Nina F. Thornhill |
author_sort | Ruomu Tan |
collection | DOAJ |
description | Kernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width when using RBF kernels in KPCA, showing that inappropriate kernel widths result in RBF-KPCA being unable to capture nonlinearity present in data. The paper also considers the choice of monitoring statistics when RBF-KPCA is applied to anomaly detection. Linear PCA requires two monitoring statistics. The Hotelling's T<sup>2</sup> monitoring statistic detects when a sample exceeds the healthy operating range, while the Squared Prediction Error (SPE) monitoring statistic detects the case when the sample does not follow the model of the training data. The analysis in this article shows that SPE for RBF-KPCA can detect both cases. Moreover, unlike the case of linear PCA, the T<sup>2</sup> monitoring statistic for RBF-KPCA is non-monotonic with respect to the magnitude of the anomaly, making it not optimal as a monitoring statistic. The paper presents examples to illustrate these points. The paper also provides a detailed mathematical analysis which explains the observations from a theoretical perspective. Tuning strategies are proposed for setting the kernel width and the detection threshold of the monitoring statistic. The performance of optimally tuned RBF-KPCA for anomaly detection is demonstrated via numerical simulation and a benchmark dataset from an industrialscale facility. |
first_indexed | 2024-12-16T17:02:10Z |
format | Article |
id | doaj.art-921d2e98a92c460b8381146275fc67b0 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-16T17:02:10Z |
publishDate | 2020-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-921d2e98a92c460b8381146275fc67b02022-12-21T22:23:42ZengIEEEIEEE Access2169-35362020-01-01819832819834210.1109/ACCESS.2020.30345509241766Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function KernelsRuomu Tan0https://orcid.org/0000-0002-9151-7245James R. Ottewill1Nina F. Thornhill2https://orcid.org/0000-0001-5604-8335Centre for Process Systems Engineering, Imperial College London, London, U.KHitachi ABB Power Grids Research, Kraków, PolandCentre for Process Systems Engineering, Imperial College London, London, U.KKernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width when using RBF kernels in KPCA, showing that inappropriate kernel widths result in RBF-KPCA being unable to capture nonlinearity present in data. The paper also considers the choice of monitoring statistics when RBF-KPCA is applied to anomaly detection. Linear PCA requires two monitoring statistics. The Hotelling's T<sup>2</sup> monitoring statistic detects when a sample exceeds the healthy operating range, while the Squared Prediction Error (SPE) monitoring statistic detects the case when the sample does not follow the model of the training data. The analysis in this article shows that SPE for RBF-KPCA can detect both cases. Moreover, unlike the case of linear PCA, the T<sup>2</sup> monitoring statistic for RBF-KPCA is non-monotonic with respect to the magnitude of the anomaly, making it not optimal as a monitoring statistic. The paper presents examples to illustrate these points. The paper also provides a detailed mathematical analysis which explains the observations from a theoretical perspective. Tuning strategies are proposed for setting the kernel width and the detection threshold of the monitoring statistic. The performance of optimally tuned RBF-KPCA for anomaly detection is demonstrated via numerical simulation and a benchmark dataset from an industrialscale facility.https://ieeexplore.ieee.org/document/9241766/Anomaly detectionasymptotic analysisfault detectionkernel principal component analysismonitoring statisticmultivariate statistics |
spellingShingle | Ruomu Tan James R. Ottewill Nina F. Thornhill Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels IEEE Access Anomaly detection asymptotic analysis fault detection kernel principal component analysis monitoring statistic multivariate statistics |
title | Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels |
title_full | Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels |
title_fullStr | Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels |
title_full_unstemmed | Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels |
title_short | Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels |
title_sort | monitoring statistics and tuning of kernel principal component analysis with radial basis function kernels |
topic | Anomaly detection asymptotic analysis fault detection kernel principal component analysis monitoring statistic multivariate statistics |
url | https://ieeexplore.ieee.org/document/9241766/ |
work_keys_str_mv | AT ruomutan monitoringstatisticsandtuningofkernelprincipalcomponentanalysiswithradialbasisfunctionkernels AT jamesrottewill monitoringstatisticsandtuningofkernelprincipalcomponentanalysiswithradialbasisfunctionkernels AT ninafthornhill monitoringstatisticsandtuningofkernelprincipalcomponentanalysiswithradialbasisfunctionkernels |