Toward Fast and Scalable Random Walks over Disk-Resident Graphs via Efficient I/O Management

Rui Wang,Yinlong Xu,Yongkun Li,Hong Xie,John C S Lui,Shuibing He

doi:10.1145/3533579

Abstract

Traditional graph systems mainly use the iteration-based model, which iteratively loads graph blocks into memory for analysis so as to reduce random I/Os. However, this iteration-based model limits the efficiency and scalability of running random walk, which is a fundamental technique to analyze large graphs. In this article, we first propose a state-aware I/O model to improve the I/O efficiency of running random walk, then we develop a block-centric indexing and buffering scheme for managing walk data, and leverage an asynchronous walk updating strategy to improve random walk efficiency. We implement an I/O-efficient graph system, GraphWalker , which is efficient to handle very large disk-resident graphs and also scalable to run tens of billions of random walks with only a single commodity machine. Experiments show that GraphWalker can achieve more than an order of magnitude speedup when compared with DrunkardMob, which is tailored for random walks based on the classical graph system GraphChi, as well as two state-of-the-art single-machine graph systems, Graphene and GraFSoft. Furthermore, when compared with the most recent distributed system KnightKing, GraphWalker still achieves comparable performance with only a single machine, thereby making it a more cost-effective alternative.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Toward Fast and Scalable Random Walks over Disk-Resident Graphs via Efficient I/O Management

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Storage

Lead the way for us

Journal: ACM Transactions on Storage	Publication Date: Nov 11, 2022
Citations: 2

Similar Papers

An I/O-efficient disk-based graph system for scalable second-order random walk of large graphs
Hongzheng Li ... Lei Chen
Proceedings of the VLDB Endowment | VOL. 15
Hongzheng Li, et. al.Hongzheng Li ... Lei Chen
01 Apr 2022
Proceedings of the VLDB Endowment | VOL. 15

Graph-XLL: a Graph Library for Extra Large Graph Analytics on a Single Machine
Jian Wu ... Venkatesh Srinivasan
-
Jian Wu, et. al.Jian Wu ... Venkatesh Srinivasan
01 Jul 2019
01 Jul 2019

Fast incremental proximity search in large graphs
Purnamrita Sarkar ... Amit Prakash
-
Purnamrita Sarkar, et. al.Purnamrita Sarkar ... Amit Prakash
01 Jan 2008
01 Jan 2008

GO: Out-Of-Core Partitioning of Large Irregular Graphs
Gurneet Kaur ... Rajiv Gupta
-
Gurneet Kaur, et. al.Gurneet Kaur ... Rajiv Gupta
01 Oct 2021
01 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward Fast and Scalable Random Walks over Disk-Resident Graphs via Efficient I/O Management

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Storage