Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data

Minho Bae,Sangyoon Oh,Minjoong Jeong

doi:10.1109/access.2020.2987355

Minho Bae, Sangyoon Oh + Show 1 more

Open Access

https://doi.org/10.1109/access.2020.2987355

Copy DOI

Abstract

The increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topology, where the size of each partition is balanced while the number of edge cuts is reduced. Moreover, a graph-partitioning algorithm should achieve high performance and scalability. In this study, we present a novel graph-partitioning algorithm that ensures a high edge cutting quality and excellent parallel processing performance. We apply formulas based on the label propagation algorithm to improve the quality of edge cuts and achieve fast convergence. In our approach, the necessity of applying the label propagation process for all vertices is removed, and the process is applied only for candidate vertices based on a score metric. Our proposed algorithm introduces a stabilization phase in which remote and highly connected vertices are relocated to prevent the algorithm from becoming trapped in local optima. Comparison results show that a prototype based on the proposed algorithm outperforms well-known parallel graph-partitioning frameworks in terms of speed and balance.

Highlights

Graph data have become increasingly important for applications in various fields, such as e-science, medical information systems, and social data management systems [1]
PARALLEL GRAPH-PARTITIONING ALGORITHM we describe the parallel processing of our graph-partitioning algorithm consisting of quick-converging label propagation’’ (QCLP), as well as the stabilization phase
Because QCLP and higher connectivity to remote vertices (HCRV) are similar but use different flows, we describe only those parts that differ from the QCLP phase

Summary

INTRODUCTION

Graph data have become increasingly important for applications in various fields, such as e-science, medical information systems, and social data management systems [1]. Many previous approaches for graph partitioning are based on a local search algorithm, such as the Kernighan–Lin (KL) [20] and Fiduccia–Mattheyses (FM) algorithms [21] These algorithms require considerable computation to obtain the optimal edge cut as the number of vertices, i.e., nodes, and partitions increases. We propose a novel parallel graphpartitioning algorithm that provides a low edge cut degree and high performance processing capability for large-scale graph data. Distributed machines need to share the new position, i.e., partition, or vertex score for the iteration in LP In this process, a correlation exists between the overhead of the data update frequency and the accuracy of the vertex position. M. Bae et al.: LP-Based Parallel Graph Partitioning for Large-Scale Graph Data vertex position being more accurate, which in turn increases the quality of the edge cut partitioning.

RELATED WORK

DEFINITION OF THE SCORE

10: Update T ScoreL

EXPERIMENT

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Parallel and External High Quality Graph Partitioning

-

01 Jan 2019
01 Jan 2019

OLPGP: An Optimized Label Propagation-Based Distributed Graph Partitioning Algorithm
Haoqing Ren ... Bin Wu
-
Haoqing Ren, et. al.Haoqing Ren ... Bin Wu
01 Jan 2021
01 Jan 2021

Distributed Application Global States Monitoring in PEGASUS DA Applied to Parallel Graph Partitioning
Adam Smyk ... Marek Tudruj
Concurrency and Computation: Practice and Experience | VOL. 33
Adam Smyk, et. al.Adam Smyk ... Marek Tudruj
13 Oct 2020
Concurrency and Computation: Practice and Experience | VOL. 33

Arbor: Efficient Large-Scale Graph Data Computing Model
Wei Zhou ... Bo Li
-
Wei Zhou, et. al.Wei Zhou ... Bo Li
01 Nov 2013
01 Nov 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access