A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy
When the density peak clustering algorithm deals with complex datasets and the problem of multiple density peaks in the same cluster, the subjectively selected cluster centers are not accurate enough, and the allocation of non-cluster centers is prone to joint and several errors. To solve the above...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2022-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9885191/ |
_version_ | 1798001669320998912 |
---|---|
author | Fuxiang Li Ming Zhou Shu Li Tianhao Yang |
author_facet | Fuxiang Li Ming Zhou Shu Li Tianhao Yang |
author_sort | Fuxiang Li |
collection | DOAJ |
description | When the density peak clustering algorithm deals with complex datasets and the problem of multiple density peaks in the same cluster, the subjectively selected cluster centers are not accurate enough, and the allocation of non-cluster centers is prone to joint and several errors. To solve the above problems, we propose a new density peak clustering algorithm based on cluster fusion strategy. First, the algorithm screens out the candidate cluster centers by setting two new thresholds to avoid the influence of noise points and outliers. Second, the remaining data points are allocated according to the density peak clustering algorithm to obtain the initial clusters. Third, considering the structural characteristics and spatial distribution of datasets, the new definitions of boundary points, inter-cluster intersection density and inter-cluster boundary density are provided. To correctly classify the clustering problems with multiple density peaks in the same cluster, a new cluster fusion strategy is proposed, which not only corrects the joint and several errors in the allocation of data points, but also correctly selects the cluster centers. Finally, to test the effectiveness of the proposed clustering algorithm, which is compared with DPC-KNN, DPC, K-means and DBSCAN on nine synthetic datasets and six real datasets. The experimental results demonstrate that the clustering performance of the proposed algorithm outperforms that of other algorithms. |
first_indexed | 2024-04-11T11:40:00Z |
format | Article |
id | doaj.art-56d7722652cb48d5b1bc87f135efe543 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-11T11:40:00Z |
publishDate | 2022-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-56d7722652cb48d5b1bc87f135efe5432022-12-22T04:25:51ZengIEEEIEEE Access2169-35362022-01-0110980349804710.1109/ACCESS.2022.32057429885191A New Density Peak Clustering Algorithm Based on Cluster Fusion StrategyFuxiang Li0https://orcid.org/0000-0003-0569-7655Ming Zhou1https://orcid.org/0000-0002-3002-4746Shu Li2https://orcid.org/0000-0003-1742-2480Tianhao Yang3https://orcid.org/0000-0002-9916-8238School of Science, Harbin University of Science and Technology, Harbin, ChinaSchool of Science, Harbin University of Science and Technology, Harbin, ChinaSchool of Science, Harbin University of Science and Technology, Harbin, ChinaSchool of Science, Harbin University of Science and Technology, Harbin, ChinaWhen the density peak clustering algorithm deals with complex datasets and the problem of multiple density peaks in the same cluster, the subjectively selected cluster centers are not accurate enough, and the allocation of non-cluster centers is prone to joint and several errors. To solve the above problems, we propose a new density peak clustering algorithm based on cluster fusion strategy. First, the algorithm screens out the candidate cluster centers by setting two new thresholds to avoid the influence of noise points and outliers. Second, the remaining data points are allocated according to the density peak clustering algorithm to obtain the initial clusters. Third, considering the structural characteristics and spatial distribution of datasets, the new definitions of boundary points, inter-cluster intersection density and inter-cluster boundary density are provided. To correctly classify the clustering problems with multiple density peaks in the same cluster, a new cluster fusion strategy is proposed, which not only corrects the joint and several errors in the allocation of data points, but also correctly selects the cluster centers. Finally, to test the effectiveness of the proposed clustering algorithm, which is compared with DPC-KNN, DPC, K-means and DBSCAN on nine synthetic datasets and six real datasets. The experimental results demonstrate that the clustering performance of the proposed algorithm outperforms that of other algorithms.https://ieeexplore.ieee.org/document/9885191/Clusteringdensity peakscandidate cluster centercluster fusion strategy |
spellingShingle | Fuxiang Li Ming Zhou Shu Li Tianhao Yang A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy IEEE Access Clustering density peaks candidate cluster center cluster fusion strategy |
title | A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy |
title_full | A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy |
title_fullStr | A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy |
title_full_unstemmed | A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy |
title_short | A New Density Peak Clustering Algorithm Based on Cluster Fusion Strategy |
title_sort | new density peak clustering algorithm based on cluster fusion strategy |
topic | Clustering density peaks candidate cluster center cluster fusion strategy |
url | https://ieeexplore.ieee.org/document/9885191/ |
work_keys_str_mv | AT fuxiangli anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT mingzhou anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT shuli anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT tianhaoyang anewdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT fuxiangli newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT mingzhou newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT shuli newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy AT tianhaoyang newdensitypeakclusteringalgorithmbasedonclusterfusionstrategy |