Data Reduction Techniques: A Comparative Study

Ahmed Alkarawi,Kadhim Aljanabi

doi:10.31642/jokmc/2018/090201

Abstract

Data preprocessing in general and data reduction in specific represent the main steps in data mining techniques and algorithms since data in real world due to its vastness, the analysis will take a long time to complete .Almost all mining techniques including classification, clustering, association and others have high time and space complexities due to the huge amount of data and the algorithm behavior itself. That is the reason why data reduction represent an important phase in Knowledge Discovery in Databases (KDD) process. Many researchers introduced important solutions in this field. The study in this paper represents a comparative study for about 22 research papers in data reduction fields that covers different data reduction techniques such as dimensionality reduction, numerisoty reduction, sampling, clustering data cube aggregation and other techniques. From the conducted study, it can be concluded that the appropriate technique that can be used in data reduction is highly dependent on the data type, the dataset size, the application goal, the availability of noise and outliers and the compromise between the reduced data and the knowledge required from the analysis

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Reduction Techniques: A Comparative Study

Abstract

Talk to us

Similar Papers

More From: Journal of Kufa for Mathematics and Computer

Lead the way for us

Journal: Journal of Kufa for Mathematics and Computer	Publication Date: Aug 30, 2022
License type: CC BY 4.0

Similar Papers

Data mining and knowledge discovery for process monitoring and control, by X.Z. Wang, Advances in Industrial Control, Springer, London, 1999, pp. 1–251, ISBN 1‐85233‐137‐2
Matthew J Wade
International Journal of Adaptive Control and Signal Processing | VOL. 20
Matthew J WadeMatthew J Wade
08 Aug 2006
International Journal of Adaptive Control and Signal Processing | VOL. 20

Knowledge Discovery in Spatial Databases
Martin Ester ... Hans-Peter Kriegel
-
Martin Ester, et. al.Martin Ester ... Hans-Peter Kriegel
01 Jan 1998
01 Jan 1998

Data Clutter Reduction in Sampling Technique
Nur Nina Manarina Jamalludin ... Ahmad Afif Ahmarofi
International Journal of Advanced Computer Science and Applications | VOL. 13
Nur Nina Manarina Jamalludin, et. al.Nur Nina Manarina Jamalludin ... Ahmad Afif Ahmarofi
01 Jan 2021
International Journal of Advanced Computer Science and Applications | VOL. 13

KDD-Based Decision Making: A Conceptual Framework Model for Maternal Health and Child Immunization Databases
Sourabh Shastri ... Vibhakar Mansotra
-
Sourabh Shastri, et. al.Sourabh Shastri ... Vibhakar Mansotra
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Reduction Techniques: A Comparative Study

Abstract

Talk to us

Similar Papers

More From: Journal of Kufa for Mathematics and Computer