Initial Seed Value Effectiveness on Performances of Data Mining Algorithms

After 2000s, Computer capacities and features are increased and access to data made easy. However, the produced and recorded data should be meaningful. Transformation of unprocessed data into meaningful information can be done with the help of data mining. In this study, classification methods fr...

Full description

Bibliographic Details
Main Authors:	İrem Duzdar Argun, Tunahan Timuçin
Format:	Article
Language:	English
Published:	Düzce University 2021-04-01
Series:	Düzce Üniversitesi Bilim ve Teknoloji Dergisi
Subjects:	data mining classification credit approval seed value veri madenciliği sınıflandırma kredi onayı tohum değeri
Online Access:	https://dergipark.org.tr/tr/download/article-file/1353472

_version_	1797300824255234048
author	İrem Duzdar Argun Tunahan Timuçin
author_facet	İrem Duzdar Argun Tunahan Timuçin
author_sort	İrem Duzdar Argun
collection	DOAJ
description	After 2000s, Computer capacities and features are increased and access to data made easy. However, the produced and recorded data should be meaningful. Transformation of unprocessed data into meaningful information can be done with the help of data mining. In this study, classification methods from data mining applications are studied. First, the parameters that make the results of the same data set different were investigated on 4 different data mining tools (Weka, Rapid Miner, Knime, Orange), It has been tested with 3 different algorithms (K nearest neighborhood, Naive Bayes, Random Forest). In order to evaluate the performance of the data set while creating the classification models, the data set was divided into training data and test data as 80% -20%, 70% -30% and 60-40%. The accuracy, roc and precision values was used to test the performance of the classifying data. While classifying, the effect of algorithm parameters on the results is observed. The most important of these parameters is the initial seed value. The initial seed is a value using especially in classification algorithms that determines the initial placement of the data and directly affects the result. In this respect, it is very important to determine the initial seed value correctly. In this study, initial seed values between 0 and 100 were evaluated and it was shown that the classification could change the accuracy value approximately by 5%.
first_indexed	2024-03-07T23:12:40Z
format	Article
id	doaj.art-16724ce153754ae0832ac63881feacaa
institution	Directory Open Access Journal
issn	2148-2446
language	English
last_indexed	2024-03-07T23:12:40Z
publishDate	2021-04-01
publisher	Düzce University
record_format	Article
series	Düzce Üniversitesi Bilim ve Teknoloji Dergisi
spelling	doaj.art-16724ce153754ae0832ac63881feacaa2024-02-21T14:07:29ZengDüzce UniversityDüzce Üniversitesi Bilim ve Teknoloji Dergisi2148-24462021-04-019255556710.29130/dubited.81310197Initial Seed Value Effectiveness on Performances of Data Mining Algorithmsİrem Duzdar Argun0Tunahan Timuçin1DUZCE UNIVERSITYDÜZCE ÜNİVERSİTESİAfter 2000s, Computer capacities and features are increased and access to data made easy. However, the produced and recorded data should be meaningful. Transformation of unprocessed data into meaningful information can be done with the help of data mining. In this study, classification methods from data mining applications are studied. First, the parameters that make the results of the same data set different were investigated on 4 different data mining tools (Weka, Rapid Miner, Knime, Orange), It has been tested with 3 different algorithms (K nearest neighborhood, Naive Bayes, Random Forest). In order to evaluate the performance of the data set while creating the classification models, the data set was divided into training data and test data as 80% -20%, 70% -30% and 60-40%. The accuracy, roc and precision values was used to test the performance of the classifying data. While classifying, the effect of algorithm parameters on the results is observed. The most important of these parameters is the initial seed value. The initial seed is a value using especially in classification algorithms that determines the initial placement of the data and directly affects the result. In this respect, it is very important to determine the initial seed value correctly. In this study, initial seed values between 0 and 100 were evaluated and it was shown that the classification could change the accuracy value approximately by 5%.https://dergipark.org.tr/tr/download/article-file/1353472data miningclassificationcredit approvalseed valueveri madenciliğisınıflandırmakredi onayıtohum değeri
spellingShingle	İrem Duzdar Argun Tunahan Timuçin Initial Seed Value Effectiveness on Performances of Data Mining Algorithms Düzce Üniversitesi Bilim ve Teknoloji Dergisi data mining classification credit approval seed value veri madenciliği sınıflandırma kredi onayı tohum değeri
title	Initial Seed Value Effectiveness on Performances of Data Mining Algorithms
title_full	Initial Seed Value Effectiveness on Performances of Data Mining Algorithms
title_fullStr	Initial Seed Value Effectiveness on Performances of Data Mining Algorithms
title_full_unstemmed	Initial Seed Value Effectiveness on Performances of Data Mining Algorithms
title_short	Initial Seed Value Effectiveness on Performances of Data Mining Algorithms
title_sort	initial seed value effectiveness on performances of data mining algorithms
topic	data mining classification credit approval seed value veri madenciliği sınıflandırma kredi onayı tohum değeri
url	https://dergipark.org.tr/tr/download/article-file/1353472
work_keys_str_mv	AT iremduzdarargun initialseedvalueeffectivenessonperformancesofdataminingalgorithms AT tunahantimucin initialseedvalueeffectivenessonperformancesofdataminingalgorithms

Initial Seed Value Effectiveness on Performances of Data Mining Algorithms

Similar Items