K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algo...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Politeknik Negeri Batam
2023-12-01
|
Series: | Journal of Applied Informatics and Computing |
Subjects: | |
Online Access: | https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491 |
_version_ | 1827589915878096896 |
---|---|
author | Rofiq Muhammad Syauqi Puspita Nurul Sabrina Irma Santikarama |
author_facet | Rofiq Muhammad Syauqi Puspita Nurul Sabrina Irma Santikarama |
author_sort | Rofiq Muhammad Syauqi |
collection | DOAJ |
description | In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algorithm is prone to the problem of missing values, which can significantly affect the quality of the resulting clusters. To overcome this challenge, imputation methods are used, including mean imputation and K-Nearest Neighbor (KNN) imputation. This study aims to analyze the impact of imputation methods on CPU Benchmark Compilation clustering results. Evaluation of the clustering results using the silhouette coefficient showed that clustering with mean imputation achieved a score of 0.782, while with KNN imputation it achieved a score of 0.777. In addition, the cluster interpretation results show that the KNN method produces more information that is easier for users to understand. This research provides valuable insights into the effectiveness of imputation methods in improving the quality of data clustering results in assisting CPU selection decisions on CPU Benchmark Compilation data. |
first_indexed | 2024-03-09T01:08:13Z |
format | Article |
id | doaj.art-01563569742f4848a66e445266ceba30 |
institution | Directory Open Access Journal |
issn | 2548-6861 |
language | English |
last_indexed | 2024-03-09T01:08:13Z |
publishDate | 2023-12-01 |
publisher | Politeknik Negeri Batam |
record_format | Article |
series | Journal of Applied Informatics and Computing |
spelling | doaj.art-01563569742f4848a66e445266ceba302023-12-11T08:06:22ZengPoliteknik Negeri BatamJournal of Applied Informatics and Computing2548-68612023-12-017223123910.30871/jaic.v7i2.64916491K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation DataRofiq Muhammad SyauqiPuspita Nurul SabrinaIrma SantikaramaIn the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets. One of the effective clustering algorithms is k-means. However, this algorithm is prone to the problem of missing values, which can significantly affect the quality of the resulting clusters. To overcome this challenge, imputation methods are used, including mean imputation and K-Nearest Neighbor (KNN) imputation. This study aims to analyze the impact of imputation methods on CPU Benchmark Compilation clustering results. Evaluation of the clustering results using the silhouette coefficient showed that clustering with mean imputation achieved a score of 0.782, while with KNN imputation it achieved a score of 0.777. In addition, the cluster interpretation results show that the KNN method produces more information that is easier for users to understand. This research provides valuable insights into the effectiveness of imputation methods in improving the quality of data clustering results in assisting CPU selection decisions on CPU Benchmark Compilation data.https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491clusteringimputasi knnimputasi meank-meanssilhouette coefficient |
spellingShingle | Rofiq Muhammad Syauqi Puspita Nurul Sabrina Irma Santikarama K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data Journal of Applied Informatics and Computing clustering imputasi knn imputasi mean k-means silhouette coefficient |
title | K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data |
title_full | K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data |
title_fullStr | K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data |
title_full_unstemmed | K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data |
title_short | K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data |
title_sort | k means clustering with knn and mean imputation on cpu benchmark compilation data |
topic | clustering imputasi knn imputasi mean k-means silhouette coefficient |
url | https://jurnal.polibatam.ac.id/index.php/JAIC/article/view/6491 |
work_keys_str_mv | AT rofiqmuhammadsyauqi kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata AT puspitanurulsabrina kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata AT irmasantikarama kmeansclusteringwithknnandmeanimputationoncpubenchmarkcompilationdata |