ThunderRW

Shixuan Sun,Yuchen Li,Bingsheng He,Yuhang Chen,Shengliang Lu

doi:10.14778/3476249.3476257

Abstract

As random walk is a powerful tool in many graph processing, mining and learning applications, this paper proposes an efficient in-memory random walk engine named ThunderRW. Compared with existing parallel systems on improving the performance of a single graph operation, ThunderRW supports massive parallel random walks. The core design of ThunderRW is motivated by our profiling results: common RW algorithms have as high as 73.1% CPU pipeline slots stalled due to irregular memory access, which suffers significantly more memory stalls than the conventional graph workloads such as BFS and SSSP. To improve the memory efficiency, we first design a generic step-centric programming model named Gather-Move-Update to abstract different RW algorithms. Based on the programming model, we develop the step interleaving technique to hide memory access latency by switching the executions of different random walk queries. In our experiments, we use four representative RW algorithms including PPR, DeepWalk, Node2Vec and MetaPath to demonstrate the efficiency and programming flexibility of ThunderRW. Experimental results show that ThunderRW outperforms state-of-the-art approaches by an order of magnitude, and the step interleaving technique significantly reduces the CPU pipeline stall from 73.1% to 15.0%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ThunderRW

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Jul 1, 2021
Citations: 16

Similar Papers

Fargraph+: Excavating the parallelism of graph processing workload on RDMA-based far memory system
Jing Wang ... Minyi Guo
Journal of Parallel and Distributed Computing | VOL. 177
Jing Wang, et. al.Jing Wang ... Minyi Guo
10 Mar 2023
Journal of Parallel and Distributed Computing | VOL. 177

Overcoming the Memory Hierarchy Inefficiencies in Graph Processing Applications
Jilan Lin ... Yuan Xie
-
Jilan Lin, et. al.Jilan Lin ... Yuan Xie
01 Nov 2021
01 Nov 2021

Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture
Jing Wang ... Taolei Wang
-
Jing Wang, et. al.Jing Wang ... Taolei Wang
01 May 2022
01 May 2022

Random Walks on Huge Graphs at Cache Efficiency
Ke Yang ... Saravanan Thirumuruganathan
-
Ke Yang, et. al.Ke Yang ... Saravanan Thirumuruganathan
26 Oct 2021
26 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ThunderRW

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment