K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data

In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algo...

Full description

Bibliographic Details
Main Authors:	Rofiq Muhammad Syauqi, Puspita Nurul Sabrina, Irma Santikarama
Format:	Article
Language:	English
Published:	Politeknik Negeri Batam 2023-12-01
Series:	Journal of Applied Informatics and Computing
Subjects:	clustering imputasi knn imputasi mean k-means silhouette coefficient
Online Access:	https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491

_version_	1827589915878096896
author	Rofiq Muhammad Syauqi Puspita Nurul Sabrina Irma Santikarama
author_facet	Rofiq Muhammad Syauqi Puspita Nurul Sabrina Irma Santikarama
author_sort	Rofiq Muhammad Syauqi
collection	DOAJ
description	In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algorithm is prone to the problem of missing values, which can significantly affect the quality of the resulting clusters. To overcome this challenge, imputation methods are used, including mean imputation and K-Nearest Neighbor (KNN) imputation. This study aims to analyze the impact of imputation methods on CPU Benchmark Compilation clustering results. Evaluation of the clustering results using the silhouette coefficient showed that clustering with mean imputation achieved a score of 0.782, while with KNN imputation it achieved a score of 0.777. In addition, the cluster interpretation results show that the KNN method produces more information that is easier for users to understand. This research provides valuable insights into the effectiveness of imputation methods in improving the quality of data clustering results in assisting CPU selection decisions on CPU Benchmark Compilation data.
first_indexed	2024-03-09T01:08:13Z
format	Article
id	doaj.art-01563569742f4848a66e445266ceba30
institution	Directory Open Access Journal
issn	2548-6861
language	English
last_indexed	2024-03-09T01:08:13Z
publishDate	2023-12-01
publisher	Politeknik Negeri Batam
record_format	Article
series	Journal of Applied Informatics and Computing
spelling	doaj.art-01563569742f4848a66e445266ceba302023-12-11T08:06:22ZengPoliteknik Negeri BatamJournal of Applied Informatics and Computing2548-68612023-12-017223123910.30871/jaic.v7i2.64916491K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation DataRofiq Muhammad SyauqiPuspita Nurul SabrinaIrma SantikaramaIn the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algorithm is prone to the problem of missing values, which can significantly affect the quality of the resulting clusters. To overcome this challenge, imputation methods are used, including mean imputation and K-Nearest Neighbor (KNN) imputation. This study aims to analyze the impact of imputation methods on CPU Benchmark Compilation clustering results. Evaluation of the clustering results using the silhouette coefficient showed that clustering with mean imputation achieved a score of 0.782, while with KNN imputation it achieved a score of 0.777. In addition, the cluster interpretation results show that the KNN method produces more information that is easier for users to understand. This research provides valuable insights into the effectiveness of imputation methods in improving the quality of data clustering results in assisting CPU selection decisions on CPU Benchmark Compilation data.https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491clusteringimputasi knnimputasi meank-meanssilhouette coefficient
spellingShingle	Rofiq Muhammad Syauqi Puspita Nurul Sabrina Irma Santikarama K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data Journal of Applied Informatics and Computing clustering imputasi knn imputasi mean k-means silhouette coefficient
title	K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_full	K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_fullStr	K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_full_unstemmed	K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_short	K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_sort	k means clustering with knn and mean imputation on cpu benchmark compilation data
topic	clustering imputasi knn imputasi mean k-means silhouette coefficient
url	https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491
work_keys_str_mv	AT rofiqmuhammadsyauqi kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata AT puspitanurulsabrina kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata AT irmasantikarama kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata

K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data

Similar Items