Design of a Quantization-Based DNN Delta Compression Framework for Model Snapshots and Federated Learning

Haoyu Jin,Sian Jin,Wen Xia,Qing Liao,Xiangyu Zou,Shuyu Zhang,Donglei Wu,Dingwen Tao

doi:10.1109/tpds.2022.3230840

Abstract

Deep neural networks (DNNs) have achieved remarkable success in many fields. However, large-scale DNNs also bring storage costs when storing snapshots for preventing clusters’ frequent failures or incur significant communication overheads when transmitting DNNs in the Federated Learning (FL). Recently, several approaches, such as Delta-DNN and LC-Checkpoint, aim to reduce the size of DNNs’ snapshot storage by compressing the difference between two neighboring versions of the DNNs (a.k.a., delta). However, we observe that existing approaches, applying traditional global lossy quantization techniques in DNN's delta compression, can not fully exploit the data similarity since the parameters’ value ranges vary among layers. To fully explore the similarity of the delta model and improve the compression ratio, we propose a quantization-based local-sensitive delta compression approach, named QD-Compressor, by developing a layer-based local-sensitive quantization scheme and error feedback mechanism. Specifically, the quantizers and number of quantization bits are adaptive among layers based on the value distribution and weighted entropy of the delta's parameters. To avoid quantization error degrading the performance of the restored model, an alternative error feedback mechanism is designed to dynamically correct the quantization error during the training process. Experiments on multiple popular DNNs and datasets show that QD-Compressor obtains a higher 7×-40× compression ratio in the model snapshot compression scenario than the state-of-the-art approaches. Additionally, QD-Compressor achieves an 11×-15× compression ratio to the residual model of the Federated Learning compression scenario.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design of a Quantization-Based DNN Delta Compression Framework for Model Snapshots and Federated Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Mar 1, 2023
Citations: 7

Similar Papers

QD-Compressor: a Quantization-based Delta Compression Framework for Deep Neural Networks
Shuyu Zhang ... Xiangyu Zou
-
Shuyu Zhang, et. al.Shuyu Zhang ... Xiangyu Zou
01 Oct 2021
01 Oct 2021

Wireless Quantized Federated Learning: A Joint Computation and Communication Design
Pavlos S Bouzinis ... Panagiotis D Diamantoulakis
IEEE Transactions on Communications | VOL. 71
Pavlos S Bouzinis, et. al.Pavlos S Bouzinis ... Panagiotis D Diamantoulakis
01 May 2023
IEEE Transactions on Communications | VOL. 71

Class-based Quantization for Neural Networks
Wenhao Sun ... Grace Li Zhang
-
Wenhao Sun, et. al.Wenhao Sun ... Grace Li Zhang
01 Apr 2023
01 Apr 2023

Low-Latency Federated Learning With DNN Partition in Distributed Industrial IoT Networks
Xiumei Deng ... Jun Li
IEEE Journal on Selected Areas in Communications | VOL. 41
Xiumei Deng, et. al.Xiumei Deng ... Jun Li
01 Mar 2023
IEEE Journal on Selected Areas in Communications | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design of a Quantization-Based DNN Delta Compression Framework for Model Snapshots and Federated Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems