K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data

In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algo...

Full description

Bibliographic Details
Main Authors: Rofiq Muhammad Syauqi, Puspita Nurul Sabrina, Irma Santikarama
Format: Article
Language:English
Published: Politeknik Negeri Batam 2023-12-01
Series:Journal of Applied Informatics and Computing
Subjects:
Online Access:https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491
_version_ 1827589915878096896
author Rofiq Muhammad Syauqi
Puspita Nurul Sabrina
Irma Santikarama
author_facet Rofiq Muhammad Syauqi
Puspita Nurul Sabrina
Irma Santikarama
author_sort Rofiq Muhammad Syauqi
collection DOAJ
description In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algorithm is prone to the problem of missing values, which can significantly affect the quality of the resulting clusters. To overcome this challenge, imputation methods are used, including mean imputation and K-Nearest Neighbor (KNN) imputation. This study aims to analyze the impact of imputation methods on CPU Benchmark Compilation clustering results. Evaluation of the clustering results using the silhouette coefficient showed that clustering with mean imputation achieved a score of 0.782, while with KNN imputation it achieved a score of 0.777. In addition, the cluster interpretation results show that the KNN method produces more information that is easier for users to understand. This research provides valuable insights into the effectiveness of imputation methods in improving the quality of data clustering results in assisting CPU selection decisions on CPU Benchmark Compilation data.
first_indexed 2024-03-09T01:08:13Z
format Article
id doaj.art-01563569742f4848a66e445266ceba30
institution Directory Open Access Journal
issn 2548-6861
language English
last_indexed 2024-03-09T01:08:13Z
publishDate 2023-12-01
publisher Politeknik Negeri Batam
record_format Article
series Journal of Applied Informatics and Computing
spelling doaj.art-01563569742f4848a66e445266ceba302023-12-11T08:06:22ZengPoliteknik Negeri BatamJournal of Applied Informatics and Computing2548-68612023-12-017223123910.30871/jaic.v7i2.64916491K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation DataRofiq Muhammad SyauqiPuspita Nurul SabrinaIrma SantikaramaIn the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algorithm is prone to the problem of missing values, which can significantly affect the quality of the resulting clusters. To overcome this challenge, imputation methods are used, including mean imputation and K-Nearest Neighbor (KNN) imputation. This study aims to analyze the impact of imputation methods on CPU Benchmark Compilation clustering results. Evaluation of the clustering results using the silhouette coefficient showed that clustering with mean imputation achieved a score of 0.782, while with KNN imputation it achieved a score of 0.777. In addition, the cluster interpretation results show that the KNN method produces more information that is easier for users to understand. This research provides valuable insights into the effectiveness of imputation methods in improving the quality of data clustering results in assisting CPU selection decisions on CPU Benchmark Compilation data.https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491clusteringimputasi knnimputasi meank-meanssilhouette coefficient
spellingShingle Rofiq Muhammad Syauqi
Puspita Nurul Sabrina
Irma Santikarama
K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
Journal of Applied Informatics and Computing
clustering
imputasi knn
imputasi mean
k-means
silhouette coefficient
title K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_full K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_fullStr K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_full_unstemmed K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_short K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
title_sort k means clustering with knn and mean imputation on cpu benchmark compilation data
topic clustering
imputasi knn
imputasi mean
k-means
silhouette coefficient
url https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491
work_keys_str_mv AT rofiqmuhammadsyauqi kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata
AT puspitanurulsabrina kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata
AT irmasantikarama kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata