Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets

This article presents the data used to evaluate the performance of evolutionary clustering algorithm star (ECA*) compared to five traditional and modern clustering algorithms. Two experimental methods are employed to examine the performance of ECA* against genetic algorithm for clustering++ (GENCLUS...

Full description

Bibliographic Details
Main Authors: Bryar A. Hassan, Tarik A. Rashid, Seyedali Mirjalili
Format: Article
Language:English
Published: Elsevier 2021-06-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340921003280
_version_ 1818462045734961152
author Bryar A. Hassan
Tarik A. Rashid
Seyedali Mirjalili
author_facet Bryar A. Hassan
Tarik A. Rashid
Seyedali Mirjalili
author_sort Bryar A. Hassan
collection DOAJ
description This article presents the data used to evaluate the performance of evolutionary clustering algorithm star (ECA*) compared to five traditional and modern clustering algorithms. Two experimental methods are employed to examine the performance of ECA* against genetic algorithm for clustering++ (GENCLUST++), learning vector quantisation (LVQ), expectation maximisation (EM), K-means++ (KM++) and K-means (KM). These algorithms are applied to 32 heterogenous and multi-featured datasets to determine which one performs well on the three tests. For one, ther paper examines the efficiency of ECA* in contradiction of its corresponding algorithms using clustering evaluation measures. These validation criteria are objective function and cluster quality measures. For another, it suggests a performance rating framework to measurethe the performance sensitivity of these algorithms on varos dataset features (cluster dimensionality, number of clusters, cluster overlap, cluster shape and cluster structure). The contributions of these experiments are two-folds: (i) ECA* exceeds its counterpart aloriths in ability to find out the right cluster number; (ii) ECA* is less sensitive towards dataset features compared to its competitive techniques. Nonetheless, the results of the experiments performed demonstrate some limitations in the ECA*: (i) ECA* is not fully applied based on the premise that no prior knowledge exists; (ii) Adapting and utilising ECA* on several real applications has not been achieved yet.
first_indexed 2024-12-14T23:55:48Z
format Article
id doaj.art-2f21e8b58fa54f0589c2a584a3c6509b
institution Directory Open Access Journal
issn 2352-3409
language English
last_indexed 2024-12-14T23:55:48Z
publishDate 2021-06-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj.art-2f21e8b58fa54f0589c2a584a3c6509b2022-12-21T22:43:08ZengElsevierData in Brief2352-34092021-06-0136107044Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasetsBryar A. Hassan0Tarik A. Rashid1Seyedali Mirjalili2Kurdistan Institution for Strategic Studies and Scientific Research, Sulaimani, Iraq; Department of Computer Networks, Technical College of Informatics, Sulaimani Polytechnic University, Sulaimani, Iraq; Corresponding author at: Kurdistan Institution for Strategic Studies and Scientific Research, Sulaimani, Iraq.Computer Science and Engineering Department, University of Kurdistan Hewler, Erbil, IraqCentre for Artificial Intelligence Research and Optimisation, Torrens University, Australia; Yonsei Frontier Lab, Yonsei University, Seoul, KoreaThis article presents the data used to evaluate the performance of evolutionary clustering algorithm star (ECA*) compared to five traditional and modern clustering algorithms. Two experimental methods are employed to examine the performance of ECA* against genetic algorithm for clustering++ (GENCLUST++), learning vector quantisation (LVQ), expectation maximisation (EM), K-means++ (KM++) and K-means (KM). These algorithms are applied to 32 heterogenous and multi-featured datasets to determine which one performs well on the three tests. For one, ther paper examines the efficiency of ECA* in contradiction of its corresponding algorithms using clustering evaluation measures. These validation criteria are objective function and cluster quality measures. For another, it suggests a performance rating framework to measurethe the performance sensitivity of these algorithms on varos dataset features (cluster dimensionality, number of clusters, cluster overlap, cluster shape and cluster structure). The contributions of these experiments are two-folds: (i) ECA* exceeds its counterpart aloriths in ability to find out the right cluster number; (ii) ECA* is less sensitive towards dataset features compared to its competitive techniques. Nonetheless, the results of the experiments performed demonstrate some limitations in the ECA*: (i) ECA* is not fully applied based on the premise that no prior knowledge exists; (ii) Adapting and utilising ECA* on several real applications has not been achieved yet.http://www.sciencedirect.com/science/article/pii/S2352340921003280Evolutionary clustering algorithm starECA* performance evaluationECA* statistical performance evaluationECA* performance ranking framework
spellingShingle Bryar A. Hassan
Tarik A. Rashid
Seyedali Mirjalili
Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
Data in Brief
Evolutionary clustering algorithm star
ECA* performance evaluation
ECA* statistical performance evaluation
ECA* performance ranking framework
title Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
title_full Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
title_fullStr Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
title_full_unstemmed Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
title_short Performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
title_sort performance evaluation results of evolutionary clustering algorithm star for clustering heterogeneous datasets
topic Evolutionary clustering algorithm star
ECA* performance evaluation
ECA* statistical performance evaluation
ECA* performance ranking framework
url http://www.sciencedirect.com/science/article/pii/S2352340921003280
work_keys_str_mv AT bryarahassan performanceevaluationresultsofevolutionaryclusteringalgorithmstarforclusteringheterogeneousdatasets
AT tarikarashid performanceevaluationresultsofevolutionaryclusteringalgorithmstarforclusteringheterogeneousdatasets
AT seyedalimirjalili performanceevaluationresultsofevolutionaryclusteringalgorithmstarforclusteringheterogeneousdatasets