A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy

When the density peak clustering algorithm deals with complex datasets and the problem of multiple density peaks in the same cluster, the subjectively selected cluster centers are not accurate enough, and the allocation of non-cluster centers is prone to joint and several errors. To solve the above...

Full description

Bibliographic Details
Main Authors: Fuxiang Li, Ming Zhou, Shu Li, Tianhao Yang
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9885191/
_version_ 1798001669320998912
author Fuxiang Li
Ming Zhou
Shu Li
Tianhao Yang
author_facet Fuxiang Li
Ming Zhou
Shu Li
Tianhao Yang
author_sort Fuxiang Li
collection DOAJ
description When the density peak clustering algorithm deals with complex datasets and the problem of multiple density peaks in the same cluster, the subjectively selected cluster centers are not accurate enough, and the allocation of non-cluster centers is prone to joint and several errors. To solve the above problems, we propose a new density peak clustering algorithm based on cluster fusion strategy. First, the algorithm screens out the candidate cluster centers by setting two new thresholds to avoid the influence of noise points and outliers. Second, the remaining data points are allocated according to the density peak clustering algorithm to obtain the initial clusters. Third, considering the structural characteristics and spatial distribution of datasets, the new definitions of boundary points, inter-cluster intersection density and inter-cluster boundary density are provided. To correctly classify the clustering problems with multiple density peaks in the same cluster, a new cluster fusion strategy is proposed, which not only corrects the joint and several errors in the allocation of data points, but also correctly selects the cluster centers. Finally, to test the effectiveness of the proposed clustering algorithm, which is compared with DPC-KNN, DPC, K-means and DBSCAN on nine synthetic datasets and six real datasets. The experimental results demonstrate that the clustering performance of the proposed algorithm outperforms that of other algorithms.
first_indexed 2024-04-11T11:40:00Z
format Article
id doaj.art-56d7722652cb48d5b1bc87f135efe543
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T11:40:00Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-56d7722652cb48d5b1bc87f135efe5432022-12-22T04:25:51ZengIEEEIEEE Access2169-35362022-01-0110980349804710.1109/ACCESS.2022.32057429885191A New Density Peak Clustering Algorithm Based on Cluster Fusion StrategyFuxiang Li0https://orcid.org/0000-0003-0569-7655Ming Zhou1https://orcid.org/0000-0002-3002-4746Shu Li2https://orcid.org/0000-0003-1742-2480Tianhao Yang3https://orcid.org/0000-0002-9916-8238School of Science, Harbin University of Science and Technology, Harbin, ChinaSchool of Science, Harbin University of Science and Technology, Harbin, ChinaSchool of Science, Harbin University of Science and Technology, Harbin, ChinaSchool of Science, Harbin University of Science and Technology, Harbin, ChinaWhen the density peak clustering algorithm deals with complex datasets and the problem of multiple density peaks in the same cluster, the subjectively selected cluster centers are not accurate enough, and the allocation of non-cluster centers is prone to joint and several errors. To solve the above problems, we propose a new density peak clustering algorithm based on cluster fusion strategy. First, the algorithm screens out the candidate cluster centers by setting two new thresholds to avoid the influence of noise points and outliers. Second, the remaining data points are allocated according to the density peak clustering algorithm to obtain the initial clusters. Third, considering the structural characteristics and spatial distribution of datasets, the new definitions of boundary points, inter-cluster intersection density and inter-cluster boundary density are provided. To correctly classify the clustering problems with multiple density peaks in the same cluster, a new cluster fusion strategy is proposed, which not only corrects the joint and several errors in the allocation of data points, but also correctly selects the cluster centers. Finally, to test the effectiveness of the proposed clustering algorithm, which is compared with DPC-KNN, DPC, K-means and DBSCAN on nine synthetic datasets and six real datasets. The experimental results demonstrate that the clustering performance of the proposed algorithm outperforms that of other algorithms.https://ieeexplore.ieee.org/document/9885191/Clusteringdensity peakscandidate cluster centercluster fusion strategy
spellingShingle Fuxiang Li
Ming Zhou
Shu Li
Tianhao Yang
A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
IEEE Access
Clustering
density peaks
candidate cluster center
cluster fusion strategy
title A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
title_full A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
title_fullStr A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
title_full_unstemmed A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
title_short A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
title_sort new density peak clustering algorithm based on cluster fusion strategy
topic Clustering
density peaks
candidate cluster center
cluster fusion strategy
url https://ieeexplore.ieee.org/document/9885191/
work_keys_str_mv AT fuxiangli anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT mingzhou anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT shuli anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT tianhaoyang anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT fuxiangli newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT mingzhou newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT shuli newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy
AT tianhaoyang newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy