Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification

The accuracy of most classification methods is significantly affected by missing values. Therefore, this study aimed to propose a data imputation method to handle missing values through the application of nearest neighbor data and fuzzy membership function as well as to compare the results with stan...

Full description

Bibliographic Details
Main Authors: Kurnia Muludi, Revita Setianingsih, Ridho Sholehurrohman, Akmal Junaidi
Format: Article
Language:English
Published: PeerJ Inc. 2024-03-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1968.pdf
Description
Summary:The accuracy of most classification methods is significantly affected by missing values. Therefore, this study aimed to propose a data imputation method to handle missing values through the application of nearest neighbor data and fuzzy membership function as well as to compare the results with standard methods. A total of five datasets related to classification problems obtained from the UCI Machine Learning Repository were used. The results showed that the proposed method had higher accuracy than standard imputation methods. Moreover, triangular method performed better than Gaussian fuzzy membership function. This showed that the combination of nearest neighbor data and fuzzy membership function was more effective in handling missing values and improving classification accuracy.
ISSN:2376-5992