Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor

There was an increase in the number of late payments of tuition fees by 3,018 from a total of 5,535 students at the end of 2020. This study uses the Python library which requires data to be of numeric type, so it requires data transformation according to the type of data in the study, data that has...

Full description

Bibliographic Details
Main Authors:	Muhammad Norhalimi, Taghfirul Azhima Yoga Siswa
Format:	Article
Language:	English
Published:	Universitas Islam Negeri Sunan Kalijaga Yogyakarta 2022-09-01
Series:	JISKA (Jurnal Informatika Sunan Kalijaga)
Subjects:	Prediction Naïve Bayes K-Nearest Neighbor Information Gain Confusion Matrix
Online Access:	https://ejournal.uin-suka.ac.id/saintek/JISKA/article/view/3533

_version_	1797758861073973248
author	Muhammad Norhalimi Taghfirul Azhima Yoga Siswa
author_facet	Muhammad Norhalimi Taghfirul Azhima Yoga Siswa
author_sort	Muhammad Norhalimi
collection	DOAJ
description	There was an increase in the number of late payments of tuition fees by 3,018 from a total of 5,535 students at the end of 2020. This study uses the Python library which requires data to be of numeric type, so it requires data transformation according to the type of data in the study, data that has a scale is transformed using an ordinal encoder, and data that does not have a scale is transformed using one-hot encoding. The purpose of this study was to evaluate the performance of the Naïve Bayes algorithm and K-Nearest Neighbor with a confusion matrix in predicting late payment of tuition fees at UMKT. The dataset used in this study was sourced from the financial administration bureau as many as 12,408 data with a distribution of 90:10. Based on the results of the calculation of the selection of information gain features, the best 4 attributes that influence the research are obtained, namely faculty, study program, class, and gender. The results of the evaluation of the confusion matrix that have the best performance using the Naïve Bayes with information gain algorithm obtain an accuracy of 55.19%, while the K-Nearest Neighbor with information gain only obtains an accuracy of 50.76%. Based on the accuracy results obtained in the prediction of late payment of tuition fees by using attributes derived from information gain, it influences increasing the accuracy of Naïve Bayes, but the use of the information gain attribute on the K-Nearest Neighbor algorithm makes the accuracy obtained decrease.
first_indexed	2024-03-12T18:36:06Z
format	Article
id	doaj.art-a2cadad6da5641df94d14b3f7185536b
institution	Directory Open Access Journal
issn	2527-5836 2528-0074
language	English
last_indexed	2024-03-12T18:36:06Z
publishDate	2022-09-01
publisher	Universitas Islam Negeri Sunan Kalijaga Yogyakarta
record_format	Article
series	JISKA (Jurnal Informatika Sunan Kalijaga)
spelling	doaj.art-a2cadad6da5641df94d14b3f7185536b2023-08-02T08:02:35ZengUniversitas Islam Negeri Sunan Kalijaga YogyakartaJISKA (Jurnal Informatika Sunan Kalijaga)2527-58362528-00742022-09-0173Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest NeighborMuhammad Norhalimi0Taghfirul Azhima Yoga Siswa1Universitas Muhammadiyah Kalimantan TimurUniversitas Muhammadiyah Kalimantan Timur There was an increase in the number of late payments of tuition fees by 3,018 from a total of 5,535 students at the end of 2020. This study uses the Python library which requires data to be of numeric type, so it requires data transformation according to the type of data in the study, data that has a scale is transformed using an ordinal encoder, and data that does not have a scale is transformed using one-hot encoding. The purpose of this study was to evaluate the performance of the Naïve Bayes algorithm and K-Nearest Neighbor with a confusion matrix in predicting late payment of tuition fees at UMKT. The dataset used in this study was sourced from the financial administration bureau as many as 12,408 data with a distribution of 90:10. Based on the results of the calculation of the selection of information gain features, the best 4 attributes that influence the research are obtained, namely faculty, study program, class, and gender. The results of the evaluation of the confusion matrix that have the best performance using the Naïve Bayes with information gain algorithm obtain an accuracy of 55.19%, while the K-Nearest Neighbor with information gain only obtains an accuracy of 50.76%. Based on the accuracy results obtained in the prediction of late payment of tuition fees by using attributes derived from information gain, it influences increasing the accuracy of Naïve Bayes, but the use of the information gain attribute on the K-Nearest Neighbor algorithm makes the accuracy obtained decrease. https://ejournal.uin-suka.ac.id/saintek/JISKA/article/view/3533PredictionNaïve BayesK-Nearest NeighborInformation GainConfusion Matrix
spellingShingle	Muhammad Norhalimi Taghfirul Azhima Yoga Siswa Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor JISKA (Jurnal Informatika Sunan Kalijaga) Prediction Naïve Bayes K-Nearest Neighbor Information Gain Confusion Matrix
title	Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor
title_full	Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor
title_fullStr	Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor
title_full_unstemmed	Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor
title_short	Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor
title_sort	optimasi seleksi fitur information gain pada algoritma naive bayes dan k nearest neighbor
topic	Prediction Naïve Bayes K-Nearest Neighbor Information Gain Confusion Matrix
url	https://ejournal.uin-suka.ac.id/saintek/JISKA/article/view/3533
work_keys_str_mv	AT muhammadnorhalimi optimasiseleksifiturinformationgainpadaalgoritmanaivebayesdanknearestneighbor AT taghfirulazhimayogasiswa optimasiseleksifiturinformationgainpadaalgoritmanaivebayesdanknearestneighbor

Optimasi Seleksi Fitur Information Gain pada Algoritma Naïve Bayes dan K-Nearest Neighbor

Similar Items