Summary: | Anomaly detection of time series has been widely used in various fields. Most detection methods depend either on assumptions about data distribution or manual threshold setting. If the assumption is incorrect, the effectiveness of detection technology will be greatly reduced. To deal with this problem, we propose a maximum likelihood estimation method based on particle swarm optimization for generalized Pareto model to detect outliers of time series, which can be called Generalized Pareto Model Based on Particle Swarm Optimization (GPMPSO). Because the generalized Pareto model is multidimensional, we introduce a comprehensive learning strategy to improve search ability of particle swarm algorithm. Due to the multiple peaks of the log-likelihood function of generalized Pareto model, we apply dynamic neighbors to reduce the possibility of particle swarm optimization falling into local optimum. Moreover, we propose a new processing model Big Drift Streaming Peak Over Threshold (BDSPOT) to enhance the capability of the data stream processor. Our algorithm is tested on various real-world datasets which demonstrate its very competitive performance.
|