Data mining reduction methods and performances of rules

In data mining the accuracy of models are associated with the strength of the rules.However, most machine learning techniques produce a large number of rules.The consequence is with large number of rules generated,processing time is much longer. This study examines rules of different lengths of attr...

Full description

Bibliographic Details
Main Authors:	Ahmad, Faudziah, Basir, Mohammad Aizat
Format:	Conference or Workshop Item
Language:	English
Published:	2009
Subjects:	QA76 Computer software
Online Access:	https://repo.uum.edu.my/id/eprint/13596/1/PID263.pdf

_version_	1825803215160999936
author	Ahmad, Faudziah Basir, Mohammad Aizat
author_facet	Ahmad, Faudziah Basir, Mohammad Aizat
author_sort	Ahmad, Faudziah
collection	UUM
description	In data mining the accuracy of models are associated with the strength of the rules.However, most machine learning techniques produce a large number of rules.The consequence is with large number of rules generated,processing time is much longer. This study examines rules of different lengths of attributes in terms of performance based on percentage of accuracy. The research adopts the Knowledge Discovery in Databases “KDD” methodology for analysis and applies various data mining techniques in the experiments.Data of 50 hardware dataset companies which, contains 31 attributes and 400 records have been used. In summary, results show that in terms of performance of rules, Genetic Algorithm has produced the highest number of rules followed by Johnson’s Algorithm and Holte’s 1R.The best classifier for extracting rules in this study is VOT (Voting of Object Tracking).In terms of performance of rules, best results comes from rules with 30 attributes, followed by rules with 1 intersection attribute and lastly rules with 3 intersection attributes. Among the three sets of attributes, the 3 intersection attributes are considered as the attributes that can be used as predictor attributes.
first_indexed	2024-07-04T05:53:12Z
format	Conference or Workshop Item
id	uum-13596
institution	Universiti Utara Malaysia
language	English
last_indexed	2024-07-04T05:53:12Z
publishDate	2009
record_format	eprints
spelling	uum-135962015-04-07T03:17:59Z https://repo.uum.edu.my/id/eprint/13596/ Data mining reduction methods and performances of rules Ahmad, Faudziah Basir, Mohammad Aizat QA76 Computer software In data mining the accuracy of models are associated with the strength of the rules.However, most machine learning techniques produce a large number of rules.The consequence is with large number of rules generated,processing time is much longer. This study examines rules of different lengths of attributes in terms of performance based on percentage of accuracy. The research adopts the Knowledge Discovery in Databases “KDD” methodology for analysis and applies various data mining techniques in the experiments.Data of 50 hardware dataset companies which, contains 31 attributes and 400 records have been used. In summary, results show that in terms of performance of rules, Genetic Algorithm has produced the highest number of rules followed by Johnson’s Algorithm and Holte’s 1R.The best classifier for extracting rules in this study is VOT (Voting of Object Tracking).In terms of performance of rules, best results comes from rules with 30 attributes, followed by rules with 1 intersection attribute and lastly rules with 3 intersection attributes. Among the three sets of attributes, the 3 intersection attributes are considered as the attributes that can be used as predictor attributes. 2009-06-24 Conference or Workshop Item PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/13596/1/PID263.pdf Ahmad, Faudziah and Basir, Mohammad Aizat (2009) Data mining reduction methods and performances of rules. In: International Conference on Computing and Informatics 2009 (ICOCI09), 24-25 June 2009, Legend Hotel, Kuala Lumpur. http://www.icoci.cms.net.my
spellingShingle	QA76 Computer software Ahmad, Faudziah Basir, Mohammad Aizat Data mining reduction methods and performances of rules
title	Data mining reduction methods and performances of rules
title_full	Data mining reduction methods and performances of rules
title_fullStr	Data mining reduction methods and performances of rules
title_full_unstemmed	Data mining reduction methods and performances of rules
title_short	Data mining reduction methods and performances of rules
title_sort	data mining reduction methods and performances of rules
topic	QA76 Computer software
url	https://repo.uum.edu.my/id/eprint/13596/1/PID263.pdf
work_keys_str_mv	AT ahmadfaudziah dataminingreductionmethodsandperformancesofrules AT basirmohammadaizat dataminingreductionmethodsandperformancesofrules

Data mining reduction methods and performances of rules

Similar Items