Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels

Kernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width...

Full description

Bibliographic Details
Main Authors: Ruomu Tan, James R. Ottewill, Nina F. Thornhill
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9241766/
_version_ 1818617216670629888
author Ruomu Tan
James R. Ottewill
Nina F. Thornhill
author_facet Ruomu Tan
James R. Ottewill
Nina F. Thornhill
author_sort Ruomu Tan
collection DOAJ
description Kernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width when using RBF kernels in KPCA, showing that inappropriate kernel widths result in RBF-KPCA being unable to capture nonlinearity present in data. The paper also considers the choice of monitoring statistics when RBF-KPCA is applied to anomaly detection. Linear PCA requires two monitoring statistics. The Hotelling's T<sup>2</sup> monitoring statistic detects when a sample exceeds the healthy operating range, while the Squared Prediction Error (SPE) monitoring statistic detects the case when the sample does not follow the model of the training data. The analysis in this article shows that SPE for RBF-KPCA can detect both cases. Moreover, unlike the case of linear PCA, the T<sup>2</sup> monitoring statistic for RBF-KPCA is non-monotonic with respect to the magnitude of the anomaly, making it not optimal as a monitoring statistic. The paper presents examples to illustrate these points. The paper also provides a detailed mathematical analysis which explains the observations from a theoretical perspective. Tuning strategies are proposed for setting the kernel width and the detection threshold of the monitoring statistic. The performance of optimally tuned RBF-KPCA for anomaly detection is demonstrated via numerical simulation and a benchmark dataset from an industrialscale facility.
first_indexed 2024-12-16T17:02:10Z
format Article
id doaj.art-921d2e98a92c460b8381146275fc67b0
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-16T17:02:10Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-921d2e98a92c460b8381146275fc67b02022-12-21T22:23:42ZengIEEEIEEE Access2169-35362020-01-01819832819834210.1109/ACCESS.2020.30345509241766Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function KernelsRuomu Tan0https://orcid.org/0000-0002-9151-7245James R. Ottewill1Nina F. Thornhill2https://orcid.org/0000-0001-5604-8335Centre for Process Systems Engineering, Imperial College London, London, U.KHitachi ABB Power Grids Research, Krak&#x00F3;w, PolandCentre for Process Systems Engineering, Imperial College London, London, U.KKernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width when using RBF kernels in KPCA, showing that inappropriate kernel widths result in RBF-KPCA being unable to capture nonlinearity present in data. The paper also considers the choice of monitoring statistics when RBF-KPCA is applied to anomaly detection. Linear PCA requires two monitoring statistics. The Hotelling's T<sup>2</sup> monitoring statistic detects when a sample exceeds the healthy operating range, while the Squared Prediction Error (SPE) monitoring statistic detects the case when the sample does not follow the model of the training data. The analysis in this article shows that SPE for RBF-KPCA can detect both cases. Moreover, unlike the case of linear PCA, the T<sup>2</sup> monitoring statistic for RBF-KPCA is non-monotonic with respect to the magnitude of the anomaly, making it not optimal as a monitoring statistic. The paper presents examples to illustrate these points. The paper also provides a detailed mathematical analysis which explains the observations from a theoretical perspective. Tuning strategies are proposed for setting the kernel width and the detection threshold of the monitoring statistic. The performance of optimally tuned RBF-KPCA for anomaly detection is demonstrated via numerical simulation and a benchmark dataset from an industrialscale facility.https://ieeexplore.ieee.org/document/9241766/Anomaly detectionasymptotic analysisfault detectionkernel principal component analysismonitoring statisticmultivariate statistics
spellingShingle Ruomu Tan
James R. Ottewill
Nina F. Thornhill
Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
IEEE Access
Anomaly detection
asymptotic analysis
fault detection
kernel principal component analysis
monitoring statistic
multivariate statistics
title Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
title_full Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
title_fullStr Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
title_full_unstemmed Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
title_short Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels
title_sort monitoring statistics and tuning of kernel principal component analysis with radial basis function kernels
topic Anomaly detection
asymptotic analysis
fault detection
kernel principal component analysis
monitoring statistic
multivariate statistics
url https://ieeexplore.ieee.org/document/9241766/
work_keys_str_mv AT ruomutan monitoringstatisticsandtuningofkernelprincipalcomponentanalysiswithradialbasisfunctionkernels
AT jamesrottewill monitoringstatisticsandtuningofkernelprincipalcomponentanalysiswithradialbasisfunctionkernels
AT ninafthornhill monitoringstatisticsandtuningofkernelprincipalcomponentanalysiswithradialbasisfunctionkernels