Big Data Reduction Methods: A Survey

Muhammad Habib Ur Rehman,Assad Abbas,Samee U Khan,Chee Sun Liew,Teh Ying Wah,Prem Prakash Jayaraman

doi:10.1007/s41019-016-0022-0

Muhammad Habib Ur Rehman, Assad Abbas + Show 4 more

Open Access

https://doi.org/10.1007/s41019-016-0022-0

Copy DOI

Abstract

Research on big data analytics is entering in the new phase called fast data where multiple gigabytes of data arrive in the big data systems every second. Modern big data systems collect inherently complex data streams due to the volume, velocity, value, variety, variability, and veracity in the acquired data and consequently give rise to the 6Vs of big data. The reduced and relevant data streams are perceived to be more useful than collecting raw, redundant, inconsistent, and noisy data. Another perspective for big data reduction is that the million variables big datasets cause the curse of dimensionality which requires unbounded computational resources to uncover actionable knowledge patterns. This article presents a review of methods that are used for big data reduction. It also presents a detailed taxonomic discussion of big data reduction methods including the network theory, big data compression, dimension reduction, redundancy elimination, data mining, and machine learning methods. In addition, the open research issues pertinent to the big data reduction are also highlighted.

Highlights

Big data is the aggregation of large-scale, voluminous, and multi-format data streams originated from heterogeneous and autonomous data sources [1]
The presented literature review reveals that there is no existing method that can handle the issue of big data complexity single-handedly by considering the all 6Vs of big data
The studies discussed in this article mainly focused on data reduction in terms of volume and variety

Summary

Introduction

Big data is the aggregation of large-scale, voluminous, and multi-format data streams originated from heterogeneous and autonomous data sources [1]. Dimension reduction techniques are useful to handle the heterogeneity and massiveness of big data by reducing million variable data into manageable size [8,9,10,11]. These techniques usually work at post-data collection phases. This article presents a thorough literature review of methods for big data reduction. We aim to present a detailed literature review that is articulated to highlight the existing methods relevant to big data reduction.

Big Data Complexity and the Need for Data Reduction

Network Theory

Compression

Methods

Data Preprocessing

Dimension Reduction

Open Research Issues

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Science and Engineering	Publication Date: Dec 1, 2016
Citations: 131	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Big Data Reduction Methods: A Survey

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering

Lead the way for us

Similar Papers

A Secure High-Order Lanczos-Based Orthogonal Tensor SVD for Big Data Reduction in Cloud Environment
Guohui Dai ... Deqing Zou
IEEE Transactions on Big Data | VOL. 5
Guohui Dai, et. al.Guohui Dai ... Deqing Zou
01 Sep 2019
IEEE Transactions on Big Data | VOL. 5

Research on the detection of privacy information sharing behaviour of e-commerce users based on big data
Yingji Li ... Xuan Fu
International Journal of Autonomous and Adaptive Communications Systems | VOL. 15
Yingji Li, et. al.Yingji Li ... Xuan Fu
01 Jan 2021
International Journal of Autonomous and Adaptive Communications Systems | VOL. 15

Tensor-based Big Biometric Data Reduction in Cloud
Ronghao Zhang ... Laurence T Yang
IEEE Cloud Computing | VOL. 5
Ronghao Zhang, et. al.Ronghao Zhang ... Laurence T Yang
01 Jul 2018
IEEE Cloud Computing | VOL. 5

Big data reduction framework for value creation in sustainable enterprises
Victor Chang ... Aisha Batool
International Journal of Information Management | VOL. 36
Victor Chang, et. al.Victor Chang ... Aisha Batool
18 Jun 2016
International Journal of Information Management | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big Data Reduction Methods: A Survey

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering