A Comparative Study of Parallel and Distributed Big Data programming models: Methodologies, Challenges and Future Directions

Fiza Gulzar Hussain Fiza Gulzar Hussain,Ayesha Nasir Ayesha Nasir,Muhammad Wasim Muhammad Wasim

doi:10.54692/lgurjcsit.2023.073365

Fiza Gulzar Hussain Fiza Gulzar Hussain, Ayesha Nasir Ayesha Nasir + Show 1 more

Open Access

https://doi.org/10.54692/lgurjcsit.2023.073365

Copy DOI

Abstract

According to a survey conducted in 2021, users share about 4 petabytes of data on Facebook daily. The exponential increase in data (called big data) plays a vital role in machine learning, internet of things (IoT), and business intelligence applications. Due to the rapid increase in big data, research in big data programming models gained much interest in the past decade. Today, many programming paradigms exist to handle big data, and selecting an appropriate model for a project is critical for its success. This study provides an in-depth analysis of big data programming models such as MapReduce, Directed Acyclic Graph (DAG), Bulk Synchronous Parallel (BSP), and SQL. We conduct a comparative study of distributed and parallel big data programming models and categorize these models into three classes: traditional data processing, graph-based processing, and query-based processing models. Furthermore, we evaluate these programming models based on different parameters like performance, data processing, storage, fault-tolerant, suitable language, and machine learning support. Finally, we highlight the benchmark datasets used for big data programming models and discuss the challenges of models along with future directions for the research community.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparative Study of Parallel and Distributed Big Data programming models: Methodologies, Challenges and Future Directions

Abstract

Talk to us

Similar Papers

More From: Lahore Garrison University Research Journal of Computer Science and Information Technology

Lead the way for us

Journal: Lahore Garrison University Research Journal of Computer Science and Information Technology	Publication Date: Oct 9, 2023
License type: CC BY-NC 4.0

Similar Papers

Role of IoT, Machine Learning, and Big Data in Smart Building
K Manimala
-
K ManimalaK Manimala
09 Aug 2021
09 Aug 2021

Programming models and systems for Big Data analysis
Loris Belcastro ... Domenico Talia
International Journal of Parallel, Emergent and Distributed Systems | VOL. 34
Loris Belcastro, et. al.Loris Belcastro ... Domenico Talia
05 Jan 2018
International Journal of Parallel, Emergent and Distributed Systems | VOL. 34

Big Data Analytics and Its Applications in IoT
Shaila S G ... Bhuvana D S
-
Shaila S G, et. al.Shaila S G ... Bhuvana D S
28 Dec 2020
28 Dec 2020

A cloud platform for big IoT data analytics by combining batch and stream processing technologies
D M C Dissanayake ... K P N Jayasena
-
D M C Dissanayake, et. al.D M C Dissanayake ... K P N Jayasena
01 Sep 2017
01 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Study of Parallel and Distributed Big Data programming models: Methodologies, Challenges and Future Directions

Abstract

Talk to us

Similar Papers

More From: Lahore Garrison University Research Journal of Computer Science and Information Technology