A Low-Complexity and Adaptive Distributed Source Coding Design for Model Aggregation in Distributed Learning

Naifu Zhang,Meixia Tao

doi:10.1109/ojcoms.2022.3228813

Abstract

A major bottleneck in distributed learning is the communication overhead of exchanging intermediate model update parameters between the worker nodes and the parameter server. Recently, it is found that local gradients among different worker nodes are correlated. Therefore, distributed source coding (DSC) can be applied to increase communication efficiency by exploiting such correlation. However, it is highly non-trivial to exploite the gradient correlations in distributed learning due to the unknown and time-varying gradient correlation. In this paper, we first propose a DSC framework, named successive Wyner-Ziv coding, for distributed learning based on quantization and Slepian-Wolf (SW) coding. We prove that the proposed framework can achieve the theoretically minimum communication cost from an information theory perspective. We also propose a low-complexity and adaptive DSC for distributed learning, including a gradient statistics estimator, rate controller, and a log-likelihood ratio (LLR) computer. The gradient statistics estimator estimates the gradient statistics online based only on the quantized gradients at previous iterations, hence it does not introduce extra communication cost. The computation complexity of the rate controller and the LLR computer is reduced to a linear growth in the number of worker nodes by introducing a semi-analytical Monte Carlo simulation. Finally, we design a DSC-based distributed learning process and find that the extra delay introduced by DSC does not scale with the number of worker nodes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Low-Complexity and Adaptive Distributed Source Coding Design for Model Aggregation in Distributed Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Open Journal of the Communications Society

Lead the way for us

Journal: IEEE Open Journal of the Communications Society	Publication Date: Jan 1, 2022
License type: CC BY-NC-ND 4.0

Similar Papers

An Adaptive Distributed Source Coding Design for Distributed Learning
Naifu Zhang ... Meixia Tao
-
Naifu Zhang, et. al.Naifu Zhang ... Meixia Tao
20 Oct 2021
20 Oct 2021

Distributed Matrix Multiplication Based on Frame Quantization for Straggler Mitigation
Kyungrak Son ... Wan Choi
IEEE Transactions on Signal Processing | VOL. 70
Kyungrak Son, et. al.Kyungrak Son ... Wan Choi
01 Jan 2021
IEEE Transactions on Signal Processing | VOL. 70

Reduce the Energy Cost of Elastic Clusters by Queueing Workloads with N-1 Queues
Cheng Hu ... Mingdong Tang
-
Cheng Hu, et. al.Cheng Hu ... Mingdong Tang
23 Dec 2019
23 Dec 2019

T-Storm: Traffic-Aware Online Scheduling in Storm
Jielong Xu ... Jian Tang
-
Jielong Xu, et. al.Jielong Xu ... Jian Tang
01 Jun 2014
01 Jun 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Low-Complexity and Adaptive Distributed Source Coding Design for Model Aggregation in Distributed Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Open Journal of the Communications Society