Data mining reduction methods and performances of rules

In data mining the accuracy of models are associated with the strength of the rules.However, most machine learning techniques produce a large number of rules.The consequence is with large number of rules generated,processing time is much longer. This study examines rules of different lengths of attr...

Full description

Bibliographic Details
Main Authors: Ahmad, Faudziah, Basir, Mohammad Aizat
Format: Conference or Workshop Item
Language:English
Published: 2009
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/13596/1/PID263.pdf
_version_ 1825803215160999936
author Ahmad, Faudziah
Basir, Mohammad Aizat
author_facet Ahmad, Faudziah
Basir, Mohammad Aizat
author_sort Ahmad, Faudziah
collection UUM
description In data mining the accuracy of models are associated with the strength of the rules.However, most machine learning techniques produce a large number of rules.The consequence is with large number of rules generated,processing time is much longer. This study examines rules of different lengths of attributes in terms of performance based on percentage of accuracy. The research adopts the Knowledge Discovery in Databases “KDD” methodology for analysis and applies various data mining techniques in the experiments.Data of 50 hardware dataset companies which, contains 31 attributes and 400 records have been used. In summary, results show that in terms of performance of rules, Genetic Algorithm has produced the highest number of rules followed by Johnson’s Algorithm and Holte’s 1R.The best classifier for extracting rules in this study is VOT (Voting of Object Tracking).In terms of performance of rules, best results comes from rules with 30 attributes, followed by rules with 1 intersection attribute and lastly rules with 3 intersection attributes. Among the three sets of attributes, the 3 intersection attributes are considered as the attributes that can be used as predictor attributes.
first_indexed 2024-07-04T05:53:12Z
format Conference or Workshop Item
id uum-13596
institution Universiti Utara Malaysia
language English
last_indexed 2024-07-04T05:53:12Z
publishDate 2009
record_format eprints
spelling uum-135962015-04-07T03:17:59Z https://repo.uum.edu.my/id/eprint/13596/ Data mining reduction methods and performances of rules Ahmad, Faudziah Basir, Mohammad Aizat QA76 Computer software In data mining the accuracy of models are associated with the strength of the rules.However, most machine learning techniques produce a large number of rules.The consequence is with large number of rules generated,processing time is much longer. This study examines rules of different lengths of attributes in terms of performance based on percentage of accuracy. The research adopts the Knowledge Discovery in Databases “KDD” methodology for analysis and applies various data mining techniques in the experiments.Data of 50 hardware dataset companies which, contains 31 attributes and 400 records have been used. In summary, results show that in terms of performance of rules, Genetic Algorithm has produced the highest number of rules followed by Johnson’s Algorithm and Holte’s 1R.The best classifier for extracting rules in this study is VOT (Voting of Object Tracking).In terms of performance of rules, best results comes from rules with 30 attributes, followed by rules with 1 intersection attribute and lastly rules with 3 intersection attributes. Among the three sets of attributes, the 3 intersection attributes are considered as the attributes that can be used as predictor attributes. 2009-06-24 Conference or Workshop Item PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/13596/1/PID263.pdf Ahmad, Faudziah and Basir, Mohammad Aizat (2009) Data mining reduction methods and performances of rules. In: International Conference on Computing and Informatics 2009 (ICOCI09), 24-25 June 2009, Legend Hotel, Kuala Lumpur. http://www.icoci.cms.net.my
spellingShingle QA76 Computer software
Ahmad, Faudziah
Basir, Mohammad Aizat
Data mining reduction methods and performances of rules
title Data mining reduction methods and performances of rules
title_full Data mining reduction methods and performances of rules
title_fullStr Data mining reduction methods and performances of rules
title_full_unstemmed Data mining reduction methods and performances of rules
title_short Data mining reduction methods and performances of rules
title_sort data mining reduction methods and performances of rules
topic QA76 Computer software
url https://repo.uum.edu.my/id/eprint/13596/1/PID263.pdf
work_keys_str_mv AT ahmadfaudziah dataminingreductionmethodsandperformancesofrules
AT basirmohammadaizat dataminingreductionmethodsandperformancesofrules