Parallel Shortest Path Graph Computations of United States Road Network Data on Apache Spark

Yasir Arfat,Rashid Mehmood,Aiiad Albeshri

doi:10.1007/978-3-319-94180-6_30

Abstract

Big data is being generated from various sources such as Internet of Things (IoT) and social media. Big data cannot be processed by traditional tools and technologies due to their properties, volume, velocity, veracity, and variety. Graphs are becoming increasingly popular to model real-world problems; the problems are typically large and, hence, give rise to large graphs, which could be analysed and solved using big data technologies. This paper explores the performance of single source shortest path graph computations using the Apache Spark big data platform. We use the United States road network data, modelled as graphs, and calculate shortest paths between vertices. The experiments are performed on the Aziz supercomputer (a Top500 machine). We solve problems of varying graph sizes, i.e. various states of the US, and analyse Spark’s parallelization behavior. As expected, the speedup is dependent on both the size of the data and the number of parallel nodes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallel Shortest Path Graph Computations of United States Road Network Data on Apache Spark

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Parallel Shortest Path Big Data Graph Computations of US Road Network Using Apache Spark: Survey, Architecture, and Evaluation
Yasir Arfat ... Rashid Mehmood
-
Yasir Arfat, et. al.Yasir Arfat ... Rashid Mehmood
21 Jun 2019
21 Jun 2019

Cloud computing and big data: Technologies and applications
Mostapha Zbakh ... Mohamed Essaaidi
Concurrency and Computation: Practice and Experience | VOL. 29
Mostapha Zbakh, et. al.Mostapha Zbakh ... Mohamed Essaaidi
29 Mar 2017
Concurrency and Computation: Practice and Experience | VOL. 29

A Research Roadmap of Big Data Clustering Algorithms for Future Internet of Things
Hind Bangui ... Barbora Buhnova
International Journal of Organizational and Collective Intelligence | VOL. 9
Hind Bangui, et. al.Hind Bangui ... Barbora Buhnova
01 Apr 2019
International Journal of Organizational and Collective Intelligence | VOL. 9

A comparative analysis of big data processing paradigms: Mapreduce vs. apache spark
Sifat Ibtisum ... S M Saokat Hossain
World Journal of Advanced Research and Reviews | VOL. 20
Sifat Ibtisum, et. al. Sifat Ibtisum ... S M Saokat Hossain
30 Oct 2023
World Journal of Advanced Research and Reviews | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel Shortest Path Graph Computations of United States Road Network Data on Apache Spark

Abstract

Talk to us

Similar Papers