Trends in big data analytics

Karthik Kambatla,Giorgos Kollias,Vipin Kumar,Ananth Grama

doi:10.1016/j.jpdc.2014.01.003

Abstract

One of the major applications of future generation parallel and distributed systems is in big-data analytics. Data repositories for such applications currently exceed exabytes and are rapidly increasing in size. Beyond their sheer magnitude, these datasets and associated applications’ considerations pose significant challenges for method and software development. Datasets are often distributed and their size and privacy considerations warrant distributed techniques. Data often resides on platforms with widely varying computational and network capabilities. Considerations of fault-tolerance, security, and access control are critical in many applications (Dean and Ghemawat, 2004; Apache hadoop). Analysis tasks often have hard deadlines, and data quality is a major concern in yet other applications. For most emerging applications, data-driven models and methods, capable of operating at scale, are as-yet unknown. Even when known methods can be scaled, validation of results is a major issue. Characteristics of hardware platforms and the software stack fundamentally impact data analytics. In this article, we provide an overview of the state-of-the-art and focus on emerging trends to highlight the hardware, software, and application landscape of big-data analytics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Trends in big data analytics

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Feb 2, 2014
Citations: 698

Similar Papers

Big Data and the SP Theory of Intelligence
...
-
, et. al. ...
27 Apr 2016
27 Apr 2016

Data mining with big data
Xindong Wu ... Xingquan Zhu
IEEE Transactions on Knowledge and Data Engineering | VOL. 26
Xindong Wu, et. al. Xindong Wu ... Xingquan Zhu
01 Jan 2014
IEEE Transactions on Knowledge and Data Engineering | VOL. 26

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

Data Mining with Big Data Revolution Hybrid
R Elankavi ... R Udayakumar
International Journal on Smart Sensing and Intelligent Systems | VOL. 10
R Elankavi, et. al.R Elankavi ... R Udayakumar
01 Jan 2017
International Journal on Smart Sensing and Intelligent Systems | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Trends in big data analytics

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing