EDGES: An Efficient Distributed GraphEmbedding System on GPU clusters

Dongxu Yang,Junhong Liu,Junjie Lai

doi:10.1109/tpds.2020.3041219

Abstract

Graph embedding training models access parameters sparsely in a “one-hot” manner. Currently, the distributed graph embedding neural network is learned by data parallel with the parameter server, which suffers significant performance and scalability problems. In this article, we analyze the problems and characteristics of training this kind of models on distributed GPU clusters for the first time, and find that fixed model parameters scattered among different machine nodes are a major limiting factor for efficiency. Based on our observation, we develop an efficient distributed graph embedding system called EDGES, which can utilize GPU clusters to train large graph models with billions of nodes and trillions of edges using data and model parallelism. Within the system, we propose a novel dynamic partition architecture for training these models, achieving at least one half of communication reduction compared to existing training systems. According to our evaluations on real-world networks, our system delivers a competitive accuracy for the trained embeddings, and significantly accelerates the training process of the graph node embedding neural network, achieving a speedup of 7.23x and 18.6x over the existing fastest training system on single node and multi-node, respectively. As for the scalability, our experiments show that EDGES obtains a nearly linear speedup.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EDGES: An Efficient Distributed GraphEmbedding System on GPU clusters

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Jan 1, 2020
Citations: 37

Similar Papers

Machine‐learning‐based methods for output‐only structural modal identification
Dawei Liu ... Zhiyi Tang
Structural Control and Health Monitoring | VOL. 28
Dawei Liu, et. al.Dawei Liu ... Zhiyi Tang
09 Sep 2021
Structural Control and Health Monitoring | VOL. 28

Visualizations of the training process of neural networks
Karlo Babic ... Ana Mestrovic
-
Karlo Babic, et. al.Karlo Babic ... Ana Mestrovic
01 May 2019
01 May 2019

Research on the application of genetic algorithm combined with the “cleft-overstep” algorithm for improving learning process of MLP neural network with special error surface
Cong Huu Nguyen ... Thanh Nga Thi Nguyen
-
Cong Huu Nguyen, et. al.Cong Huu Nguyen ... Thanh Nga Thi Nguyen
01 Jul 2011
01 Jul 2011

COREFERENT PAIRS DETECTION IN UKRAINIAN TEXTS USING A CONVOLUTIONAL NEURAL NETWORK
Sergiy Pogorilyy ... Artem Kramov
Visnyk Universytetu “Ukraina” | VOL. -
Sergiy Pogorilyy, et. al.Sergiy Pogorilyy ... Artem Kramov
01 Jan 2019
Visnyk Universytetu “Ukraina” | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EDGES: An Efficient Distributed GraphEmbedding System on GPU clusters

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems