Graph Reachability on Parallel Many-Core Architectures

Stefano Quer,Andrea Calabrese

doi:10.3390/computation8040103

Abstract

Many modern applications are modeled using graphs of some kind. Given a graph, reachability, that is, discovering whether there is a path between two given nodes, is a fundamental problem as well as one of the most important steps of many other algorithms. The rapid accumulation of very large graphs (up to tens of millions of vertices and edges) from a diversity of disciplines demand efficient and scalable solutions to the reachability problem. General-purpose computing has been successfully used on Graphics Processing Units (GPUs) to parallelize algorithms that present a high degree of regularity. In this paper, we extend the applicability of GPU processing to graph-based manipulation, by re-designing a simple but efficient state-of-the-art graph-labeling method, namely the GRAIL (Graph Reachability Indexing via RAndomized Interval) algorithm, to many-core CUDA-based GPUs. This algorithm firstly generates a label for each vertex of the graph, then it exploits these labels to answer reachability queries. Unfortunately, the original algorithm executes a sequence of depth-first visits which are intrinsically recursive and cannot be efficiently implemented on parallel systems. For that reason, we design an alternative approach in which a sequence of breadth-first visits substitute the original depth-first traversal to generate the labeling, and in which a high number of concurrent visits is exploited during query evaluation. The paper describes our strategy to re-design these steps, the difficulties we encountered to implement them, and the solutions adopted to overcome the main inefficiencies. To prove the validity of our approach, we compare (in terms of time and memory requirements) our GPU-based approach with the original sequential CPU-based tool. Finally, we report some hints on how to conduct further research in the area.

Highlights

The rapid accumulation of huge graphs from a diversity of disciplines, such as geographical navigation, social and biological networks, XML indexing, Internet routing, and databases, among others, requires fast and scalable graph algorithms
This difference, partially due to the different hardware architectures used in the two works, forced us to consider only large graphs and to avoid any further analysis of the small sparse and the small dense subsets
This paper focuses on designing a data-parallel version of a graph reachability algorithm

Summary

Introduction

The rapid accumulation of huge graphs from a diversity of disciplines, such as geographical navigation, social and biological networks, XML indexing, Internet routing, and databases, among others, requires fast and scalable graph algorithms. Traversal algorithms, such as breadth-first (BFS) and depth-first (DFS) searches, have linear time complexity (in the graph size) for each query. The algorithm presents linear time complexity and linear memory requirements, and, after a pre-processing phase, it is able to solve reachability queries with a cost varying from constant to linear in the graph size in terms of computation time. As our algorithm will perform four breadth-first traversals of the graph instead of one single depth-first visit, it will be essential to efficiently implement BFSs on GPUs. As a consequence, following other recent researchers in the area [13,19,20,21], we will dedicate part of our work to describe how to adapt state-of-the-art BFSs to general purpose GPU computing.

Related Works

Notation

GPUs and CUDA

The GRAIL Approach

Parallel Graph Exploration

Our GPU-Based Labeling Strategy

Step 1

Step 2

Step 3

Step 4

Low-Level Optimizations

Complexity Analysis and Memory Requirements

Our GPU-Based Searching Strategy

Experimental Results

GPU-Based Labeling

GPU-Based Search

Performance Analysis

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computation	Publication Date: Dec 2, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Graph Reachability on Parallel Many-Core Architectures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computation

Lead the way for us

Similar Papers

Accelerating aerial image simulation using improved CPU/GPU collaborative computing
Hongbo Zhang ... Chen Hu
Computers and Electrical Engineering | VOL. 46
Hongbo Zhang, et. al.Hongbo Zhang ... Chen Hu
19 Jun 2015
Computers and Electrical Engineering | VOL. 46

Context-free path querying by matrix multiplication
Semyon Grigorev ... Rustam Azimov
-
Semyon Grigorev, et. al.Semyon Grigorev ... Rustam Azimov
10 Jun 2018
10 Jun 2018

Accelerated rescaling of single Monte Carlo simulation runs with the Graphics Processing Unit (GPU)
Bernard Choi ... Owen Yang
Biomedical Optics Express | VOL. 4
Bernard Choi, et. al.Bernard Choi ... Owen Yang
29 Oct 2013
Biomedical Optics Express | VOL. 4

Accelerate the Execution of Graph Processing Using GPU
Sandip M Walunj ... Shweta Nitin Aher
-
Sandip M Walunj, et. al.Sandip M Walunj ... Shweta Nitin Aher
30 Dec 2019
30 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph Reachability on Parallel Many-Core Architectures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computation