BS-pFL: Enabling Low-Cost Personalized Federated Learning by Exploring Weight Gradient Sparsity

Lening Wang,Xin Fu,Manojna Sistla,Mingsong Chen

doi:10.1109/ijcnn55064.2022.9892137

Abstract

Recent advancements in Convolution Neural Networks (CNNs) have achieved amazing success in numerous applications. The record-breaking performance of CNNs is usually at the prohibitive training costs, thus all training data are usually processed at the powerful centralized server side, which rises privacy concerns. Federated learning (FL) is a distributed machine learning method over mobile devices to train a global model while keeping decentralized data on devices to preserve the data privacy. However, there are two major limitations to deploy FL on mobile clients. Firstly, on the client side, the limited communication and computation resources on mobile devices cannot well support the full training iterations. Secondly, on the server side, conventional FL only aggregate a common output for all the clients without personalizing the model to each client, which is an important missing feature when clients have heterogeneous data distributions. In this work, we aim to enable low-cost personalized FL by focusing on the weight gradients which are the most important exchanging parameters in FL and meanwhile, dominating the computation and communication cost. We first observe that the client's calculated weight gradients have high sparsity, and the sparse pattern in weight gradients could be predicted via very simple bit-wise operations on a sequence of bits (named bit-stream) instead of conducting expensive high-precision calculations to determine them. Furthermore, a unique pattern is exhibited in each client's uploaded weight gradients according to the distribution of its local training data. Guided by this pattern, each client can get a personalized aggregated model to fit its own data. Hence, we leverage bit-streams to predict weight gradients sparsity for low-cost training on each device, and meanwhile, bit-streams are used to represent the unique sparse pattern of the weight gradient for each client which will guide the model personalization. From our experiments, our proposed framework can improve the computation efficiency by 3.5× on average (up to 4.2×) and reduce the communication cost by 23% on average (up to 41%) while still achieving the state-of-the-art personalized accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BS-pFL: Enabling Low-Cost Personalized Federated Learning by Exploring Weight Gradient Sparsity

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exploring Normalization for High Convergence on Federated Learning for Drones
Flávio Vieira ... Carlos Alberto V Campos
Journal of the Brazilian Computer Society | VOL. 30
Flávio Vieira, et. al.Flávio Vieira ... Carlos Alberto V Campos
23 Oct 2024
Journal of the Brazilian Computer Society | VOL. 30

Federated Learning Over Wireless IoT Networks With Optimized Communication and Resources
Hao Chen ... Deyou Zhang
IEEE Internet of Things Journal | VOL. 9
Hao Chen, et. al.Hao Chen ... Deyou Zhang
01 Sep 2022
IEEE Internet of Things Journal | VOL. 9

Model Pruning Enables Efficient Federated Learning on Edge Devices.
Yuang Jiang ... Shiqiang Wang
IEEE Transactions on Neural Networks and Learning Systems | VOL. 34
Yuang Jiang, et. al.Yuang Jiang ... Shiqiang Wang
01 Dec 2023
IEEE Transactions on Neural Networks and Learning Systems | VOL. 34

A Bayesian Federated Learning Framework With Online Laplace Approximation.
Liangxi Liu ... Ling Shao
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 46
Liangxi Liu, et. al.Liangxi Liu ... Ling Shao
01 Jan 2024
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BS-pFL: Enabling Low-Cost Personalized Federated Learning by Exploring Weight Gradient Sparsity

Abstract

Talk to us

Similar Papers