A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory

The density peaks clustering (DPC) algorithm is not sensitive to the recognition of halo nodes. The halo nodes at the edge of the density peaks clustering algorithm has a lower local density. The outliers are distributed in halo nodes. The novel halo identification method based on density peaks clus...

Full description

Bibliographic Details
Main Authors: Limin Wang, Wei Zhou, Honghuan Wang, Milan Parmar, Xuming Han
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8918459/
_version_ 1830130554949861376
author Limin Wang
Wei Zhou
Honghuan Wang
Milan Parmar
Xuming Han
author_facet Limin Wang
Wei Zhou
Honghuan Wang
Milan Parmar
Xuming Han
author_sort Limin Wang
collection DOAJ
description The density peaks clustering (DPC) algorithm is not sensitive to the recognition of halo nodes. The halo nodes at the edge of the density peaks clustering algorithm has a lower local density. The outliers are distributed in halo nodes. The novel halo identification method based on density peaks clustering algorithm utilize the advantage of DBSCAN algorithm to quickly identify outliers, which improved the sensitivity to halo nodes. However, the identified halo nodes cannot be effectively assigned to adjacent clusters. Therefore, this paper will use K-nearest neighbor (KNN) algorithm to classify the identified halo nodes. K-nearest neighbor is the simplest and most efficient classification method. The KNN algorithm has the advantages of high accuracy, insensitivity to outliers and no input hypothesis data. Hence, we proposed a novel density peaks clustering halo node assignment algorithm based on K-nearest neighbor theory (KNN-HDPC). KNN-HDPC can grasp the internal relations between outliers and cluster nodes more deeply, so as to dig out the deeper relations between nodes. Experimental results demonstrate that the proposed algorithm can effectively cluster and reclassify a large number of complex data. We can quickly dig out the potential relationship between noise points and cluster points. The improved algorithm has higher clustering accuracy than the original DPC algorithm, and essentially has more robust clustering results.
first_indexed 2024-12-17T05:57:57Z
format Article
id doaj.art-df7959a685904d23a14db762cf230093
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-17T05:57:57Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-df7959a685904d23a14db762cf2300932022-12-21T22:00:57ZengIEEEIEEE Access2169-35362019-01-01717438017439010.1109/ACCESS.2019.29572428918459A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor TheoryLimin Wang0https://orcid.org/0000-0002-0576-3950Wei Zhou1https://orcid.org/0000-0003-2227-9084Honghuan Wang2https://orcid.org/0000-0003-0276-9537Milan Parmar3https://orcid.org/0000-0002-7596-407XXuming Han4https://orcid.org/0000-0002-6213-5600School of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun, ChinaSchool of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun, ChinaSchool of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun, ChinaSchool of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun, ChinaSchool of Computer Science and Engineering, Changchun University of Technology, Changchun, ChinaThe density peaks clustering (DPC) algorithm is not sensitive to the recognition of halo nodes. The halo nodes at the edge of the density peaks clustering algorithm has a lower local density. The outliers are distributed in halo nodes. The novel halo identification method based on density peaks clustering algorithm utilize the advantage of DBSCAN algorithm to quickly identify outliers, which improved the sensitivity to halo nodes. However, the identified halo nodes cannot be effectively assigned to adjacent clusters. Therefore, this paper will use K-nearest neighbor (KNN) algorithm to classify the identified halo nodes. K-nearest neighbor is the simplest and most efficient classification method. The KNN algorithm has the advantages of high accuracy, insensitivity to outliers and no input hypothesis data. Hence, we proposed a novel density peaks clustering halo node assignment algorithm based on K-nearest neighbor theory (KNN-HDPC). KNN-HDPC can grasp the internal relations between outliers and cluster nodes more deeply, so as to dig out the deeper relations between nodes. Experimental results demonstrate that the proposed algorithm can effectively cluster and reclassify a large number of complex data. We can quickly dig out the potential relationship between noise points and cluster points. The improved algorithm has higher clustering accuracy than the original DPC algorithm, and essentially has more robust clustering results.https://ieeexplore.ieee.org/document/8918459/Density peaks clusteringhalo nodeK-nearest neighborlow-density nodes
spellingShingle Limin Wang
Wei Zhou
Honghuan Wang
Milan Parmar
Xuming Han
A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
IEEE Access
Density peaks clustering
halo node
K-nearest neighbor
low-density nodes
title A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
title_full A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
title_fullStr A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
title_full_unstemmed A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
title_short A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
title_sort novel density peaks clustering halo node assignment method based on k nearest neighbor theory
topic Density peaks clustering
halo node
K-nearest neighbor
low-density nodes
url https://ieeexplore.ieee.org/document/8918459/
work_keys_str_mv AT liminwang anoveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT weizhou anoveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT honghuanwang anoveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT milanparmar anoveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT xuminghan anoveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT liminwang noveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT weizhou noveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT honghuanwang noveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT milanparmar noveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory
AT xuminghan noveldensitypeaksclusteringhalonodeassignmentmethodbasedonknearestneighbortheory