An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering

Although within-cluster information is commonly used in most clustering approaches, other important information such as between-cluster information is rarely considered in some cases. Hence, in this study, we propose a new novel measure of between-cluster distance in subspace, which is to maximize t...

Full description

Bibliographic Details
Main Authors: Liyan Xiong, Cheng Wang, Xiaohui Huang, Hui Zeng
Format: Article
Language:English
Published: MDPI AG 2019-07-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/21/7/683
_version_ 1828283084281217024
author Liyan Xiong
Cheng Wang
Xiaohui Huang
Hui Zeng
author_facet Liyan Xiong
Cheng Wang
Xiaohui Huang
Hui Zeng
author_sort Liyan Xiong
collection DOAJ
description Although within-cluster information is commonly used in most clustering approaches, other important information such as between-cluster information is rarely considered in some cases. Hence, in this study, we propose a new novel measure of between-cluster distance in subspace, which is to maximize the distance between the center of a cluster and the points that do not belong to this cluster. Based on this idea, we firstly design an optimization objective function integrating the between-cluster distance and entropy regularization in this paper. Then, updating rules are given by theoretical analysis. In the following, the properties of our proposed algorithm are investigated, and the performance is evaluated experimentally using two synthetic and seven real-life datasets. Finally, the experimental studies demonstrate that the results of the proposed algorithm (ERKM) outperform most existing state-of-the-art <i>k</i>-means-type clustering algorithms in most cases.
first_indexed 2024-04-13T08:39:39Z
format Article
id doaj.art-1fa85fd20c7d4faa80cf093fd0139dd5
institution Directory Open Access Journal
issn 1099-4300
language English
last_indexed 2024-04-13T08:39:39Z
publishDate 2019-07-01
publisher MDPI AG
record_format Article
series Entropy
spelling doaj.art-1fa85fd20c7d4faa80cf093fd0139dd52022-12-22T02:53:56ZengMDPI AGEntropy1099-43002019-07-0121768310.3390/e21070683e21070683An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace ClusteringLiyan Xiong0Cheng Wang1Xiaohui Huang2Hui Zeng3School of Information Engineering Department, East China Jiaotong University, R.d 808, East Shuanggang Avenue, Nanchang 330013, ChinaSchool of Information Engineering Department, East China Jiaotong University, R.d 808, East Shuanggang Avenue, Nanchang 330013, ChinaSchool of Information Engineering Department, East China Jiaotong University, R.d 808, East Shuanggang Avenue, Nanchang 330013, ChinaSchool of Information Engineering Department, East China Jiaotong University, R.d 808, East Shuanggang Avenue, Nanchang 330013, ChinaAlthough within-cluster information is commonly used in most clustering approaches, other important information such as between-cluster information is rarely considered in some cases. Hence, in this study, we propose a new novel measure of between-cluster distance in subspace, which is to maximize the distance between the center of a cluster and the points that do not belong to this cluster. Based on this idea, we firstly design an optimization objective function integrating the between-cluster distance and entropy regularization in this paper. Then, updating rules are given by theoretical analysis. In the following, the properties of our proposed algorithm are investigated, and the performance is evaluated experimentally using two synthetic and seven real-life datasets. Finally, the experimental studies demonstrate that the results of the proposed algorithm (ERKM) outperform most existing state-of-the-art <i>k</i>-means-type clustering algorithms in most cases.https://www.mdpi.com/1099-4300/21/7/683k-meansbetween-cluster informationentropy regularizationdata mining
spellingShingle Liyan Xiong
Cheng Wang
Xiaohui Huang
Hui Zeng
An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
Entropy
k-means
between-cluster information
entropy regularization
data mining
title An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_full An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_fullStr An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_full_unstemmed An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_short An Entropy Regularization <em>k</em>-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_sort entropy regularization em k em means algorithm with a new measure of between cluster distance in subspace clustering
topic k-means
between-cluster information
entropy regularization
data mining
url https://www.mdpi.com/1099-4300/21/7/683
work_keys_str_mv AT liyanxiong anentropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT chengwang anentropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT xiaohuihuang anentropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT huizeng anentropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT liyanxiong entropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT chengwang entropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT xiaohuihuang entropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT huizeng entropyregularizationemkemmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering