K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction

A missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing value, or replacing missing values with statistica...

Full description

Bibliographic Details
Main Authors: Abdul Fadlil, Herman, Dikky Praseptian M
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2022-08-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:http://jurnal.iaii.or.id/index.php/RESTI/article/view/4173
_version_ 1797330204056616960
author Abdul Fadlil
Herman
Dikky Praseptian M
author_facet Abdul Fadlil
Herman
Dikky Praseptian M
author_sort Abdul Fadlil
collection DOAJ
description A missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing value, or replacing missing values with statistical estimates using one calculated value such as, mean, median, min, max, and most frequent methods. Maximum likelihood and expectancy maximization, and machine learning methods such as K Nearest Neighbor (KNN). This research uses KNN Imputation to predict the missing value. The data used is data from a questionnaire survey of graduate user satisfaction levels with seven assessment criteria, namely ethics, expertise in the field of science (main competence), foreign language skills, foreign language skills, use of information technology, communication skills, cooperation, and self-development. The results of testing imputation predictions using KNNI on user satisfaction level data for STMIK PPKIA Tarakanita Rahmawati graduates from 2018 to 2021. Where using the five k closest neighbors, namely 1, 5, 10, 15, and 20, the error value of the k nearest neighbors is 5 in RMSE is 0, 316 while the error value using MAPE is 3,33 %, both values are smaller than the value of k other nearest neighbors. K nearest neighbor 5 is the best imputation prediction result, both calculated by RMSE and MAPE, even in MAPE the error value is below 10%, which means it is very good.
first_indexed 2024-03-08T07:16:40Z
format Article
id doaj.art-9a804542ce85414796d2583fdade99d0
institution Directory Open Access Journal
issn 2580-0760
language English
last_indexed 2024-03-08T07:16:40Z
publishDate 2022-08-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj.art-9a804542ce85414796d2583fdade99d02024-02-03T00:50:13ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602022-08-016457057610.29207/resti.v6i4.41734173K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User SatisfactionAbdul Fadlil0Herman1Dikky Praseptian M2Universitas Ahmad DahlanUniversitas Ahmad DahlanSTMIK PPKIA Tarakanita RahmawatiA missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing value, or replacing missing values with statistical estimates using one calculated value such as, mean, median, min, max, and most frequent methods. Maximum likelihood and expectancy maximization, and machine learning methods such as K Nearest Neighbor (KNN). This research uses KNN Imputation to predict the missing value. The data used is data from a questionnaire survey of graduate user satisfaction levels with seven assessment criteria, namely ethics, expertise in the field of science (main competence), foreign language skills, foreign language skills, use of information technology, communication skills, cooperation, and self-development. The results of testing imputation predictions using KNNI on user satisfaction level data for STMIK PPKIA Tarakanita Rahmawati graduates from 2018 to 2021. Where using the five k closest neighbors, namely 1, 5, 10, 15, and 20, the error value of the k nearest neighbors is 5 in RMSE is 0, 316 while the error value using MAPE is 3,33 %, both values are smaller than the value of k other nearest neighbors. K nearest neighbor 5 is the best imputation prediction result, both calculated by RMSE and MAPE, even in MAPE the error value is below 10%, which means it is very good.http://jurnal.iaii.or.id/index.php/RESTI/article/view/4173knnimputationmissing valuesatisfactiongarduate user
spellingShingle Abdul Fadlil
Herman
Dikky Praseptian M
K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
knn
imputation
missing value
satisfaction
garduate user
title K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_full K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_fullStr K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_full_unstemmed K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_short K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
title_sort k nearest neighbor imputation performance on missing value data graduate user satisfaction
topic knn
imputation
missing value
satisfaction
garduate user
url http://jurnal.iaii.or.id/index.php/RESTI/article/view/4173
work_keys_str_mv AT abdulfadlil knearestneighborimputationperformanceonmissingvaluedatagraduateusersatisfaction
AT herman knearestneighborimputationperformanceonmissingvaluedatagraduateusersatisfaction
AT dikkypraseptianm knearestneighborimputationperformanceonmissingvaluedatagraduateusersatisfaction