Data Reduction Techniques: A Comparative Study

Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association...

Full description

Bibliographic Details
Main Authors: Ahmed AlKarawi, Kadhim AlJanabi
Format: Article
Language:English
Published: Faculty of Computer Science and Mathematics, University of Kufa 2022-08-01
Series:Journal of Kufa for Mathematics and Computer
Subjects:
Online Access:https://journal.uokufa.edu.iq/index.php/jkmc/article/view/10387
_version_ 1797756481438744576
author Ahmed AlKarawi
Kadhim AlJanabi
author_facet Ahmed AlKarawi
Kadhim AlJanabi
author_sort Ahmed AlKarawi
collection DOAJ
description Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association and others have high time and space complexities due to the huge amount of data and the algorithm behavior itself. That is the reason why data reduction represent an important phase in Knowledge Discovery in Databases (KDD) process. Many researchers introduced important solutions in this field. The study in this paper represents a comparative study for about 22 research papers in data reduction fields that covers different data reduction techniques such as dimensionality reduction, numerisoty reduction, sampling, clustering data cube aggregation and other techniques. From the conducted study, it can be concluded that the appropriate technique that can be used in data reduction is highly dependent on the data type, the dataset size, the application goal, the availability of noise and outliers and the compromise between the reduced data and the knowledge required from the analysis
first_indexed 2024-03-12T18:02:04Z
format Article
id doaj.art-0e727c779106407294c0dbeceb5fe5aa
institution Directory Open Access Journal
issn 2076-1171
2518-0010
language English
last_indexed 2024-03-12T18:02:04Z
publishDate 2022-08-01
publisher Faculty of Computer Science and Mathematics, University of Kufa
record_format Article
series Journal of Kufa for Mathematics and Computer
spelling doaj.art-0e727c779106407294c0dbeceb5fe5aa2023-08-02T09:35:49ZengFaculty of Computer Science and Mathematics, University of KufaJournal of Kufa for Mathematics and Computer2076-11712518-00102022-08-019210.31642/JoKMC/2018/090201Data Reduction Techniques: A Comparative StudyAhmed AlKarawi0Kadhim AlJanabi1Department of Computer Science, Faculty of CS and Mathematics, University of Kufa, ALNajaf, IraqDepartment of Computer Science, Faculty of CS and Mathematics, University of Kufa, ALNajaf, Iraq Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association and others have high time and space complexities due to the huge amount of data and the algorithm behavior itself. That is the reason why data reduction represent an important phase in Knowledge Discovery in Databases (KDD) process. Many researchers introduced important solutions in this field. The study in this paper represents a comparative study for about 22 research papers in data reduction fields that covers different data reduction techniques such as dimensionality reduction, numerisoty reduction, sampling, clustering data cube aggregation and other techniques. From the conducted study, it can be concluded that the appropriate technique that can be used in data reduction is highly dependent on the data type, the dataset size, the application goal, the availability of noise and outliers and the compromise between the reduced data and the knowledge required from the analysis https://journal.uokufa.edu.iq/index.php/jkmc/article/view/10387Data MiningData PreprocessingData ReductionDimensionality Reduction
spellingShingle Ahmed AlKarawi
Kadhim AlJanabi
Data Reduction Techniques: A Comparative Study
Journal of Kufa for Mathematics and Computer
Data Mining
Data Preprocessing
Data Reduction
Dimensionality Reduction
title Data Reduction Techniques: A Comparative Study
title_full Data Reduction Techniques: A Comparative Study
title_fullStr Data Reduction Techniques: A Comparative Study
title_full_unstemmed Data Reduction Techniques: A Comparative Study
title_short Data Reduction Techniques: A Comparative Study
title_sort data reduction techniques a comparative study
topic Data Mining
Data Preprocessing
Data Reduction
Dimensionality Reduction
url https://journal.uokufa.edu.iq/index.php/jkmc/article/view/10387
work_keys_str_mv AT ahmedalkarawi datareductiontechniquesacomparativestudy
AT kadhimaljanabi datareductiontechniquesacomparativestudy