A unified approach for cluster-wise and general noise rejection approaches for k-means clustering

Hard C-means (HCM; k-means) is one of the most widely used partitive clustering techniques. However, HCM is strongly affected by noise objects and cannot represent cluster overlap. To reduce the influence of noise objects, objects distant from cluster centers are rejected in some noise rejection app...

Full description

Bibliographic Details
Main Author: Seiki Ubukata
Format: Article
Language:English
Published: PeerJ Inc. 2019-11-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-238.pdf
_version_ 1818358180552376320
author Seiki Ubukata
author_facet Seiki Ubukata
author_sort Seiki Ubukata
collection DOAJ
description Hard C-means (HCM; k-means) is one of the most widely used partitive clustering techniques. However, HCM is strongly affected by noise objects and cannot represent cluster overlap. To reduce the influence of noise objects, objects distant from cluster centers are rejected in some noise rejection approaches including general noise rejection (GNR) and cluster-wise noise rejection (CNR). Generalized rough C-means (GRCM) can deal with positive, negative, and boundary belonging of object to clusters by reference to rough set theory. GRCM realizes cluster overlap by the linear function threshold-based object-cluster assignment. In this study, as a unified approach for GNR and CNR in HCM, we propose linear function threshold-based C-means (LiFTCM) by relaxing GRCM. We show that the linear function threshold-based assignment in LiFTCM includes GNR, CNR, and their combinations as well as rough assignment of GRCM. The classification boundary is visualized so that the characteristics of LiFTCM in various parameter settings are clarified. Numerical experiments demonstrate that the combinations of rough clustering or the combinations of GNR and CNR realized by LiFTCM yield satisfactory results.
first_indexed 2024-12-13T20:24:54Z
format Article
id doaj.art-264076fce2fa451bbf86bdd814dfe145
institution Directory Open Access Journal
issn 2376-5992
language English
last_indexed 2024-12-13T20:24:54Z
publishDate 2019-11-01
publisher PeerJ Inc.
record_format Article
series PeerJ Computer Science
spelling doaj.art-264076fce2fa451bbf86bdd814dfe1452022-12-21T23:32:35ZengPeerJ Inc.PeerJ Computer Science2376-59922019-11-015e23810.7717/peerj-cs.238A unified approach for cluster-wise and general noise rejection approaches for k-means clusteringSeiki Ubukata0Graduate School of Engineering, Osaka Prefecture University, Sakai, Osaka, JapanHard C-means (HCM; k-means) is one of the most widely used partitive clustering techniques. However, HCM is strongly affected by noise objects and cannot represent cluster overlap. To reduce the influence of noise objects, objects distant from cluster centers are rejected in some noise rejection approaches including general noise rejection (GNR) and cluster-wise noise rejection (CNR). Generalized rough C-means (GRCM) can deal with positive, negative, and boundary belonging of object to clusters by reference to rough set theory. GRCM realizes cluster overlap by the linear function threshold-based object-cluster assignment. In this study, as a unified approach for GNR and CNR in HCM, we propose linear function threshold-based C-means (LiFTCM) by relaxing GRCM. We show that the linear function threshold-based assignment in LiFTCM includes GNR, CNR, and their combinations as well as rough assignment of GRCM. The classification boundary is visualized so that the characteristics of LiFTCM in various parameter settings are clarified. Numerical experiments demonstrate that the combinations of rough clustering or the combinations of GNR and CNR realized by LiFTCM yield satisfactory results.https://peerj.com/articles/cs-238.pdfClusteringk-meansNoise rejectionRough set theory
spellingShingle Seiki Ubukata
A unified approach for cluster-wise and general noise rejection approaches for k-means clustering
PeerJ Computer Science
Clustering
k-means
Noise rejection
Rough set theory
title A unified approach for cluster-wise and general noise rejection approaches for k-means clustering
title_full A unified approach for cluster-wise and general noise rejection approaches for k-means clustering
title_fullStr A unified approach for cluster-wise and general noise rejection approaches for k-means clustering
title_full_unstemmed A unified approach for cluster-wise and general noise rejection approaches for k-means clustering
title_short A unified approach for cluster-wise and general noise rejection approaches for k-means clustering
title_sort unified approach for cluster wise and general noise rejection approaches for k means clustering
topic Clustering
k-means
Noise rejection
Rough set theory
url https://peerj.com/articles/cs-238.pdf
work_keys_str_mv AT seikiubukata aunifiedapproachforclusterwiseandgeneralnoiserejectionapproachesforkmeansclustering
AT seikiubukata unifiedapproachforclusterwiseandgeneralnoiserejectionapproachesforkmeansclustering