Data Reduction Techniques: A Comparative Study
Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Faculty of Computer Science and Mathematics, University of Kufa
2022-08-01
|
Series: | Journal of Kufa for Mathematics and Computer |
Subjects: | |
Online Access: | https://journal.uokufa.edu.iq/index.php/jkmc/article/view/10387 |
_version_ | 1797756481438744576 |
---|---|
author | Ahmed AlKarawi Kadhim AlJanabi |
author_facet | Ahmed AlKarawi Kadhim AlJanabi |
author_sort | Ahmed AlKarawi |
collection | DOAJ |
description |
Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association and others have high time and space complexities due to the huge amount of data and the algorithm behavior itself. That is the reason why data reduction represent an important phase in Knowledge Discovery in Databases (KDD) process. Many researchers introduced important solutions in this field. The study in this paper represents a comparative study for about 22 research papers in data reduction fields that covers different data reduction techniques such as dimensionality reduction, numerisoty reduction, sampling, clustering data cube aggregation and other techniques. From the conducted study, it can be concluded that the appropriate technique that can be used in data reduction is highly dependent on the data type, the dataset size, the application goal, the availability of noise and outliers and the compromise between the reduced data and the knowledge required from the analysis
|
first_indexed | 2024-03-12T18:02:04Z |
format | Article |
id | doaj.art-0e727c779106407294c0dbeceb5fe5aa |
institution | Directory Open Access Journal |
issn | 2076-1171 2518-0010 |
language | English |
last_indexed | 2024-03-12T18:02:04Z |
publishDate | 2022-08-01 |
publisher | Faculty of Computer Science and Mathematics, University of Kufa |
record_format | Article |
series | Journal of Kufa for Mathematics and Computer |
spelling | doaj.art-0e727c779106407294c0dbeceb5fe5aa2023-08-02T09:35:49ZengFaculty of Computer Science and Mathematics, University of KufaJournal of Kufa for Mathematics and Computer2076-11712518-00102022-08-019210.31642/JoKMC/2018/090201Data Reduction Techniques: A Comparative StudyAhmed AlKarawi0Kadhim AlJanabi1Department of Computer Science, Faculty of CS and Mathematics, University of Kufa, ALNajaf, IraqDepartment of Computer Science, Faculty of CS and Mathematics, University of Kufa, ALNajaf, Iraq Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association and others have high time and space complexities due to the huge amount of data and the algorithm behavior itself. That is the reason why data reduction represent an important phase in Knowledge Discovery in Databases (KDD) process. Many researchers introduced important solutions in this field. The study in this paper represents a comparative study for about 22 research papers in data reduction fields that covers different data reduction techniques such as dimensionality reduction, numerisoty reduction, sampling, clustering data cube aggregation and other techniques. From the conducted study, it can be concluded that the appropriate technique that can be used in data reduction is highly dependent on the data type, the dataset size, the application goal, the availability of noise and outliers and the compromise between the reduced data and the knowledge required from the analysis https://journal.uokufa.edu.iq/index.php/jkmc/article/view/10387Data MiningData PreprocessingData ReductionDimensionality Reduction |
spellingShingle | Ahmed AlKarawi Kadhim AlJanabi Data Reduction Techniques: A Comparative Study Journal of Kufa for Mathematics and Computer Data Mining Data Preprocessing Data Reduction Dimensionality Reduction |
title | Data Reduction Techniques: A Comparative Study |
title_full | Data Reduction Techniques: A Comparative Study |
title_fullStr | Data Reduction Techniques: A Comparative Study |
title_full_unstemmed | Data Reduction Techniques: A Comparative Study |
title_short | Data Reduction Techniques: A Comparative Study |
title_sort | data reduction techniques a comparative study |
topic | Data Mining Data Preprocessing Data Reduction Dimensionality Reduction |
url | https://journal.uokufa.edu.iq/index.php/jkmc/article/view/10387 |
work_keys_str_mv | AT ahmedalkarawi datareductiontechniquesacomparativestudy AT kadhimaljanabi datareductiontechniquesacomparativestudy |