K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction

A missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing value, or replacing missing values with statistica...

Full description

Bibliographic Details
Main Authors:	Abdul Fadlil, Herman, Dikky Praseptian M
Format:	Article
Language:	English
Published:	Ikatan Ahli Informatika Indonesia 2022-08-01
Series:	Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:	knn imputation missing value satisfaction garduate user
Online Access:	http://jurnal.iaii.or.id/index.php/RESTI/article/view/4173

_version_	1797330204056616960
author	Abdul Fadlil Herman Dikky Praseptian M
author_facet	Abdul Fadlil Herman Dikky Praseptian M
author_sort	Abdul Fadlil
collection	DOAJ
description	A missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing value, or replacing missing values with statistical estimates using one calculated value such as, mean, median, min, max, and most frequent methods. Maximum likelihood and expectancy maximization, and machine learning methods such as K Nearest Neighbor (KNN). This research uses KNN Imputation to predict the missing value. The data used is data from a questionnaire survey of graduate user satisfaction levels with seven assessment criteria, namely ethics, expertise in the field of science (main competence), foreign language skills, foreign language skills, use of information technology, communication skills, cooperation, and self-development. The results of testing imputation predictions using KNNI on user satisfaction level data for STMIK PPKIA Tarakanita Rahmawati graduates from 2018 to 2021. Where using the five k closest neighbors, namely 1, 5, 10, 15, and 20, the error value of the k nearest neighbors is 5 in RMSE is 0, 316 while the error value using MAPE is 3,33 %, both values are smaller than the value of k other nearest neighbors. K nearest neighbor 5 is the best imputation prediction result, both calculated by RMSE and MAPE, even in MAPE the error value is below 10%, which means it is very good.
first_indexed	2024-03-08T07:16:40Z
format	Article
id	doaj.art-9a804542ce85414796d2583fdade99d0
institution	Directory Open Access Journal
issn	2580-0760
language	English
last_indexed	2024-03-08T07:16:40Z
publishDate	2022-08-01
publisher	Ikatan Ahli Informatika Indonesia
record_format	Article
series	Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling	doaj.art-9a804542ce85414796d2583fdade99d02024-02-03T00:50:13ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602022-08-016457057610.29207/resti.v6i4.41734173K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User SatisfactionAbdul Fadlil0Herman1Dikky Praseptian M2Universitas Ahmad DahlanUniversitas Ahmad DahlanSTMIK PPKIA Tarakanita RahmawatiA missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing value, or replacing missing values with statistical estimates using one calculated value such as, mean, median, min, max, and most frequent methods. Maximum likelihood and expectancy maximization, and machine learning methods such as K Nearest Neighbor (KNN). This research uses KNN Imputation to predict the missing value. The data used is data from a questionnaire survey of graduate user satisfaction levels with seven assessment criteria, namely ethics, expertise in the field of science (main competence), foreign language skills, foreign language skills, use of information technology, communication skills, cooperation, and self-development. The results of testing imputation predictions using KNNI on user satisfaction level data for STMIK PPKIA Tarakanita Rahmawati graduates from 2018 to 2021. Where using the five k closest neighbors, namely 1, 5, 10, 15, and 20, the error value of the k nearest neighbors is 5 in RMSE is 0, 316 while the error value using MAPE is 3,33 %, both values are smaller than the value of k other nearest neighbors. K nearest neighbor 5 is the best imputation prediction result, both calculated by RMSE and MAPE, even in MAPE the error value is below 10%, which means it is very good.http://jurnal.iaii.or.id/index.php/RESTI/article/view/4173knnimputationmissing valuesatisfactiongarduate user
spellingShingle	Abdul Fadlil Herman Dikky Praseptian M K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) knn imputation missing value satisfaction garduate user
title	K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_full	K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_fullStr	K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_full_unstemmed	K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_short	K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_sort	k nearest neighbor imputation performance on missing value data graduate user satisfaction
topic	knn imputation missing value satisfaction garduate user
url	http://jurnal.iaii.or.id/index.php/RESTI/article/view/4173
work_keys_str_mv	AT abdulfadlil knearestneighborimputationperformanceonmissingvaluedatagraduateusersatisfaction AT herman knearestneighborimputationperformanceonmissingvaluedatagraduateusersatisfaction AT dikkypraseptianm knearestneighborimputationperformanceonmissingvaluedatagraduateusersatisfaction

K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction

Similar Items