A Structure-Aware Storage Optimization for Out-of-Core Concurrent Graph Processing

Xiaofei Liao,Ligang He,Bingsheng He,Jin Zhao,Lin Gu,Yu Zhang,Hai Jin

doi:10.1109/tc.2021.3098976

Abstract

With the huge demand for graph analytics in many real-world applications, massive iterative graph processing jobs are concurrently performed on the same graphs and suffer from significant high data access cost. To lower the data access cost toward high performance, several out-of-core concurrent graph processing solutions are recently designed to handle concurrent jobs by enabling these jobs to share the accesses of the same graph data. However, the set of active vertices in each partition are usually different for various concurrent jobs and also evolve with time, where some high-degree ones (or called <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">hub-vertices</i> ) of these active vertices require more iterations to converge due to the power-law property of real-world graphs. In consequence, existing solutions still suffer from much unnecessary I/O traffic, because they have to entirely load each partition into the memory for concurrent jobs even if most vertices in this partition are inactive and may be shared by a few jobs. In this paper, we propose an efficient structure-aware storage system, called GraphSO, for higher throughput of the execution of concurrent graph processing jobs. It can be integrated into existing out-of-core graph processing systems to promote the execution efficiency of concurrent jobs with lower I/O overhead. The key design of GraphSO is a fine-grained storage management scheme. Specifically, it logically divides the partitions of existing graph processing systems into a series of small same-sized chunks. At runtime, these small chunks with active vertices are judiciously loaded by GraphSO to construct new logical partitions (i.e., each logical partition is a subset of active chunks) for existing graph processing systems to handle, where the most-frequently-used chunks are preferentially loaded to construct the logical partitions and the other ones are delayed to wait to be required by more jobs. In this way, it can effectively spare the cost of loading the graph data associated with the inactive vertices with low repartitioning overhead and can also enable the loaded graph data to be fully shared by concurrent jobs. Moreover, GraphSO also designs a buffering strategy to efficiently cache the most-frequently-used chunks in the main memory to further minimize the I/O traffic by avoiding repeated load of them. Experimental results show that GraphSO improves the throughput of GridGraph, GraphChi, X-Stream, DynamicShards, LUMOS, Graphene, and Wonderland by 1.4-3.5 times, 2.1-4.3 times, 1.9-4.1 times, 1.9-2.9 times, 1.5-3.1 times, 1.3-1.5 times, and 1.3-2.7 times after integrating with them, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Structure-Aware Storage Optimization for Out-of-Core Concurrent Graph Processing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Jul 1, 2022
Citations: 5

Similar Papers

GraphM
Jin Zhao ... Hai Jin
-
Jin Zhao, et. al.Jin Zhao ... Hai Jin
17 Nov 2019
17 Nov 2019

GraphCP: An I/O-Efficient Concurrent Graph Processing Framework
Xianghao Xu ... Yongxuan Zhang
-
Xianghao Xu, et. al.Xianghao Xu ... Yongxuan Zhang
25 Jun 2021
25 Jun 2021

GGraph: An Efficient Structure-Aware Approach for Iterative Graph Processing
Beibei Si ... Xiaofei Liao
IEEE Transactions on Big Data | VOL. 8
Beibei Si, et. al.Beibei Si ... Xiaofei Liao
26 Aug 2020
IEEE Transactions on Big Data | VOL. 8

LCCG
Jin Zhao ... Hai Jin
-
Jin Zhao, et. al.Jin Zhao ... Hai Jin
13 Nov 2021
13 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Structure-Aware Storage Optimization for Out-of-Core Concurrent Graph Processing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers