Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.

Kmeans clustering algorithm is an iterative unsupervised learning algorithm that tries to partition the given dataset into k pre-defined distinct non-overlapping clusters where each data point belongs to only one group. However, its performance is affected by its sensitivity to the initial cluster c...

Full description

Bibliographic Details
Main Authors: Abiodun M Ikotun, Absalom E Ezugwu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2022-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0272861
_version_ 1811273902678081536
author Abiodun M Ikotun
Absalom E Ezugwu
author_facet Abiodun M Ikotun
Absalom E Ezugwu
author_sort Abiodun M Ikotun
collection DOAJ
description Kmeans clustering algorithm is an iterative unsupervised learning algorithm that tries to partition the given dataset into k pre-defined distinct non-overlapping clusters where each data point belongs to only one group. However, its performance is affected by its sensitivity to the initial cluster centroids with the possibility of convergence into local optimum and specification of cluster number as the input parameter. Recently, the hybridization of metaheuristics algorithms with the K-Means algorithm has been explored to address these problems and effectively improve the algorithm's performance. Nonetheless, most metaheuristics algorithms require rigorous parameter tunning to achieve an optimum result. This paper proposes a hybrid clustering method that combines the well-known symbiotic organisms search algorithm with K-Means using the SOS as a global search metaheuristic for generating the optimum initial cluster centroids for the K-Means. The SOS algorithm is more of a parameter-free metaheuristic with excellent search quality that only requires initialising a single control parameter. The performance of the proposed algorithm is investigated by comparing it with the classical SOS, classical K-means and other existing hybrids clustering algorithms on eleven (11) UCI Machine Learning Repository datasets and one artificial dataset. The results from the extensive computational experimentation show improved performance of the hybrid SOSK-Means for solving automatic clustering compared to the standard K-Means, symbiotic organisms search clustering methods and other hybrid clustering approaches.
first_indexed 2024-04-12T23:07:58Z
format Article
id doaj.art-53cfb631ba274d248c69c4ed98bf9e42
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-12T23:07:58Z
publishDate 2022-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-53cfb631ba274d248c69c4ed98bf9e422022-12-22T03:12:52ZengPublic Library of Science (PLoS)PLoS ONE1932-62032022-01-01178e027286110.1371/journal.pone.0272861Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.Abiodun M IkotunAbsalom E EzugwuKmeans clustering algorithm is an iterative unsupervised learning algorithm that tries to partition the given dataset into k pre-defined distinct non-overlapping clusters where each data point belongs to only one group. However, its performance is affected by its sensitivity to the initial cluster centroids with the possibility of convergence into local optimum and specification of cluster number as the input parameter. Recently, the hybridization of metaheuristics algorithms with the K-Means algorithm has been explored to address these problems and effectively improve the algorithm's performance. Nonetheless, most metaheuristics algorithms require rigorous parameter tunning to achieve an optimum result. This paper proposes a hybrid clustering method that combines the well-known symbiotic organisms search algorithm with K-Means using the SOS as a global search metaheuristic for generating the optimum initial cluster centroids for the K-Means. The SOS algorithm is more of a parameter-free metaheuristic with excellent search quality that only requires initialising a single control parameter. The performance of the proposed algorithm is investigated by comparing it with the classical SOS, classical K-means and other existing hybrids clustering algorithms on eleven (11) UCI Machine Learning Repository datasets and one artificial dataset. The results from the extensive computational experimentation show improved performance of the hybrid SOSK-Means for solving automatic clustering compared to the standard K-Means, symbiotic organisms search clustering methods and other hybrid clustering approaches.https://doi.org/10.1371/journal.pone.0272861
spellingShingle Abiodun M Ikotun
Absalom E Ezugwu
Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.
PLoS ONE
title Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.
title_full Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.
title_fullStr Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.
title_full_unstemmed Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.
title_short Boosting k-means clustering with symbiotic organisms search for automatic clustering problems.
title_sort boosting k means clustering with symbiotic organisms search for automatic clustering problems
url https://doi.org/10.1371/journal.pone.0272861
work_keys_str_mv AT abiodunmikotun boostingkmeansclusteringwithsymbioticorganismssearchforautomaticclusteringproblems
AT absalomeezugwu boostingkmeansclusteringwithsymbioticorganismssearchforautomaticclusteringproblems