Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm

Accurate clustering is a challenging task with unlabeled data. Ensemble clustering aims to combine sets of base clusterings to obtain a better and more stable clustering and has shown its ability to improve clustering accuracy. Dense representation ensemble clustering (DREC) and entropy-based locall...

Full description

Bibliographic Details
Main Authors: Jiaxuan Xu, Jiang Wu, Taiyong Li, Yang Nan
Format: Article
Language:English
Published: MDPI AG 2022-09-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/24/10/1324
_version_ 1827650373856264192
author Jiaxuan Xu
Jiang Wu
Taiyong Li
Yang Nan
author_facet Jiaxuan Xu
Jiang Wu
Taiyong Li
Yang Nan
author_sort Jiaxuan Xu
collection DOAJ
description Accurate clustering is a challenging task with unlabeled data. Ensemble clustering aims to combine sets of base clusterings to obtain a better and more stable clustering and has shown its ability to improve clustering accuracy. Dense representation ensemble clustering (DREC) and entropy-based locally weighted ensemble clustering (ELWEC) are two typical methods for ensemble clustering. However, DREC treats each microcluster equally and hence, ignores the differences between each microcluster, while ELWEC conducts clustering on clusters rather than microclusters and ignores the sample–cluster relationship. To address these issues, a divergence-based locally weighted ensemble clustering with dictionary learning (DLWECDL) is proposed in this paper. Specifically, the DLWECDL consists of four phases. First, the clusters from the base clustering are used to generate microclusters. Second, a Kullback–Leibler divergence-based ensemble-driven cluster index is used to measure the weight of each microcluster. With these weights, an ensemble clustering algorithm with dictionary learning and the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>L</mi><mrow><mn>2</mn><mo>,</mo><mn>1</mn></mrow></msub></semantics></math></inline-formula>-norm is employed in the third phase. Meanwhile, the objective function is resolved by optimizing four subproblems and a similarity matrix is learned. Finally, a normalized cut (Ncut) is used to partition the similarity matrix and the ensemble clustering results are obtained. In this study, the proposed DLWECDL was validated on 20 widely used datasets and compared to some other state-of-the-art ensemble clustering methods. The experimental results demonstrated that the proposed DLWECDL is a very promising method for ensemble clustering.
first_indexed 2024-03-09T20:15:21Z
format Article
id doaj.art-4438d050224a4790947b5bcc9b529cde
institution Directory Open Access Journal
issn 1099-4300
language English
last_indexed 2024-03-09T20:15:21Z
publishDate 2022-09-01
publisher MDPI AG
record_format Article
series Entropy
spelling doaj.art-4438d050224a4790947b5bcc9b529cde2023-11-24T00:01:43ZengMDPI AGEntropy1099-43002022-09-012410132410.3390/e24101324Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-NormJiaxuan Xu0Jiang Wu1Taiyong Li2Yang Nan3School of Computing and Artificial Intelligence, Southwestern University of Finance and Economics, Chengdu 611130, ChinaSchool of Computing and Artificial Intelligence, Southwestern University of Finance and Economics, Chengdu 611130, ChinaSchool of Computing and Artificial Intelligence, Southwestern University of Finance and Economics, Chengdu 611130, ChinaDepartment of Computer Science, Harbin Finance University, Harbin 150030, ChinaAccurate clustering is a challenging task with unlabeled data. Ensemble clustering aims to combine sets of base clusterings to obtain a better and more stable clustering and has shown its ability to improve clustering accuracy. Dense representation ensemble clustering (DREC) and entropy-based locally weighted ensemble clustering (ELWEC) are two typical methods for ensemble clustering. However, DREC treats each microcluster equally and hence, ignores the differences between each microcluster, while ELWEC conducts clustering on clusters rather than microclusters and ignores the sample–cluster relationship. To address these issues, a divergence-based locally weighted ensemble clustering with dictionary learning (DLWECDL) is proposed in this paper. Specifically, the DLWECDL consists of four phases. First, the clusters from the base clustering are used to generate microclusters. Second, a Kullback–Leibler divergence-based ensemble-driven cluster index is used to measure the weight of each microcluster. With these weights, an ensemble clustering algorithm with dictionary learning and the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>L</mi><mrow><mn>2</mn><mo>,</mo><mn>1</mn></mrow></msub></semantics></math></inline-formula>-norm is employed in the third phase. Meanwhile, the objective function is resolved by optimizing four subproblems and a similarity matrix is learned. Finally, a normalized cut (Ncut) is used to partition the similarity matrix and the ensemble clustering results are obtained. In this study, the proposed DLWECDL was validated on 20 widely used datasets and compared to some other state-of-the-art ensemble clustering methods. The experimental results demonstrated that the proposed DLWECDL is a very promising method for ensemble clustering.https://www.mdpi.com/1099-4300/24/10/1324clusteringensemble clustering<i>L</i><sub>2,1</sub>-normsimilaritysubspace clusteringdictionary learning
spellingShingle Jiaxuan Xu
Jiang Wu
Taiyong Li
Yang Nan
Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm
Entropy
clustering
ensemble clustering
<i>L</i><sub>2,1</sub>-norm
similarity
subspace clustering
dictionary learning
title Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm
title_full Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm
title_fullStr Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm
title_full_unstemmed Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm
title_short Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and <i>L</i><sub>2,1</sub>-Norm
title_sort divergence based locally weighted ensemble clustering with dictionary learning and i l i sub 2 1 sub norm
topic clustering
ensemble clustering
<i>L</i><sub>2,1</sub>-norm
similarity
subspace clustering
dictionary learning
url https://www.mdpi.com/1099-4300/24/10/1324
work_keys_str_mv AT jiaxuanxu divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandilisub21subnorm
AT jiangwu divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandilisub21subnorm
AT taiyongli divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandilisub21subnorm
AT yangnan divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandilisub21subnorm