Distributed Graph Computation Meets Machine Learning

Wencong Xiao,Youshan Miao,Jilong Xue,Ming Wu,Wei Li,Lidong Zhou,Cheng Chen,Zhen Li

doi:10.1109/tpds.2020.2970047

Abstract

TuX2 is a new distributed graph engine that bridges graph computation and distributed machine learning. TuX2 inherits the benefits of elegant graph computation model, efficient graph layout, and balanced parallelism to scale to billion-edge graphs, while extended and optimized for distributed machine learning to support heterogeneity in data model, Stale Synchronous Parallel in scheduling, and a new Mini-batch, Exchange, GlobalSync, and Apply ( MEGA ) model for programming. TuX2 further introduces a hybrid vertex-cut graph optimization and supports various consistency models in fault tolerance for machine learning. We have developed a set of representative distributed machine learning algorithms in TuX2 , covering both supervised and unsupervised learning. Compared to the implementations on distributed machine learning platforms, writing those algorithms in TuX2 takes only about 25 percent of the code: our graph computation model hides the detailed management of data layout, partitioning, and parallelism from developers. The extensive evaluation of TuX2 , using large datasets with up to 64 billion of edges, shows that TuX2 outperforms PowerGraph/PowerLyra, the state-of-the-art distributed graph engines, by an order of magnitude, while beating two state-of-the-art distributed machine learning systems by at least 60 percent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distributed Graph Computation Meets Machine Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Apr 20, 2020
Citations: 78

Similar Papers

Timed Dataflow: Reducing Communication Overhead for Distributed Machine Learning Systems
Peng Sun ... Ta Nguyen Binh Duong
-
Peng Sun, et. al.Peng Sun ... Ta Nguyen Binh Duong
01 Dec 2016
01 Dec 2016

ArchNet: A data hiding design for distributed machine learning systems
Kaiyan Chang ... Jinyu Zhan
Journal of Systems Architecture | VOL. 114
Kaiyan Chang, et. al.Kaiyan Chang ... Jinyu Zhan
14 Oct 2020
Journal of Systems Architecture | VOL. 114

HiPS
Jinkun Geng ... Dan Li
-
Jinkun Geng, et. al.Jinkun Geng ... Dan Li
01 Jan 2018
01 Jan 2018

RM-KVStore: New MXNet KVStore to Accelerate Transfer Performancewith RDMA
Baocai Lv ... Zhiguang Chen
-
Baocai Lv, et. al.Baocai Lv ... Zhiguang Chen
01 Jun 2018
01 Jun 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distributed Graph Computation Meets Machine Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems