An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm
Local Outlier Factor (LOF) outlier detecting algorithm has good accuracy in detecting global and local outliers. However, the algorithm needs to traverse the entire dataset when calculating the local outlier factor of each data point, which adds extra time overhead and makes the algorithm execution...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8735734/ |
_version_ | 1819276597077737472 |
---|---|
author | Ping Yang Dan Wang Zhuojun Wei Xiaolin Du Tong Li |
author_facet | Ping Yang Dan Wang Zhuojun Wei Xiaolin Du Tong Li |
author_sort | Ping Yang |
collection | DOAJ |
description | Local Outlier Factor (LOF) outlier detecting algorithm has good accuracy in detecting global and local outliers. However, the algorithm needs to traverse the entire dataset when calculating the local outlier factor of each data point, which adds extra time overhead and makes the algorithm execution inefficient. In addition, if the K-distance neighborhood of an outlier point P contains some outliers that are incorrectly judged by the algorithm as normal points, then P may be misidentified as normal point. To solve the above problems, this paper proposes a Neighbor Entropy Local Outlier Factor (NELOF) outlier detecting algorithm. Firstly, we improve the Self-Organizing Feature Map (SOFM) algorithm and use the optimized SOFM clustering algorithm to cluster the dataset. Therefore, the calculation of each data point's local outlier factor only needs to be performed inside the small cluster. Secondly, this paper replaces the K-distance neighborhood with relative K-distance neighborhood and utilizes the entropy of relative K neighborhood to redefine the local outlier factor, which improves the accuracy of outlier detection. Experiments results confirm that our optimized SOFM algorithm can avoid the random selection of neurons, and improve clustering effect of traditional SOFM algorithm. In addition, the proposed NELOF algorithm outperforms LOF algorithm in both accuracy and execution time of outlier detection. |
first_indexed | 2024-12-23T23:42:45Z |
format | Article |
id | doaj.art-255b25808fb34fb9be2c0df158d33047 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-23T23:42:45Z |
publishDate | 2019-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-255b25808fb34fb9be2c0df158d330472022-12-21T17:25:36ZengIEEEIEEE Access2169-35362019-01-01711591411592510.1109/ACCESS.2019.29220048735734An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering AlgorithmPing Yang0https://orcid.org/0000-0002-7091-2289Dan Wang1Zhuojun Wei2Xiaolin Du3Tong Li4Faculty of Information Technology, Beijing University of Technology, Beijing, ChinaFaculty of Information Technology, Beijing University of Technology, Beijing, ChinaHuawei Software Technology Co. Ltd., Nanjing, ChinaFaculty of Information Technology, Beijing University of Technology, Beijing, ChinaFaculty of Information Technology, Beijing University of Technology, Beijing, ChinaLocal Outlier Factor (LOF) outlier detecting algorithm has good accuracy in detecting global and local outliers. However, the algorithm needs to traverse the entire dataset when calculating the local outlier factor of each data point, which adds extra time overhead and makes the algorithm execution inefficient. In addition, if the K-distance neighborhood of an outlier point P contains some outliers that are incorrectly judged by the algorithm as normal points, then P may be misidentified as normal point. To solve the above problems, this paper proposes a Neighbor Entropy Local Outlier Factor (NELOF) outlier detecting algorithm. Firstly, we improve the Self-Organizing Feature Map (SOFM) algorithm and use the optimized SOFM clustering algorithm to cluster the dataset. Therefore, the calculation of each data point's local outlier factor only needs to be performed inside the small cluster. Secondly, this paper replaces the K-distance neighborhood with relative K-distance neighborhood and utilizes the entropy of relative K neighborhood to redefine the local outlier factor, which improves the accuracy of outlier detection. Experiments results confirm that our optimized SOFM algorithm can avoid the random selection of neurons, and improve clustering effect of traditional SOFM algorithm. In addition, the proposed NELOF algorithm outperforms LOF algorithm in both accuracy and execution time of outlier detection.https://ieeexplore.ieee.org/document/8735734/Canopyclusteroutlier detectionLOFSOFM |
spellingShingle | Ping Yang Dan Wang Zhuojun Wei Xiaolin Du Tong Li An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm IEEE Access Canopy cluster outlier detection LOF SOFM |
title | An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm |
title_full | An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm |
title_fullStr | An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm |
title_full_unstemmed | An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm |
title_short | An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm |
title_sort | outlier detection approach based on improved self organizing feature map clustering algorithm |
topic | Canopy cluster outlier detection LOF SOFM |
url | https://ieeexplore.ieee.org/document/8735734/ |
work_keys_str_mv | AT pingyang anoutlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT danwang anoutlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT zhuojunwei anoutlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT xiaolindu anoutlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT tongli anoutlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT pingyang outlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT danwang outlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT zhuojunwei outlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT xiaolindu outlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm AT tongli outlierdetectionapproachbasedonimprovedselforganizingfeaturemapclusteringalgorithm |