Supplement data in federated learning with a generator transparent to clients

Xiaoya Wang,Tianqing Zhu,Wanlei Zhou

doi:10.1016/j.ins.2024.120437

Abstract

Federated learning is a decentralized learning approach that shows promise for preserving users' privacy by avoiding local data sharing. However, the heterogeneous data in federated learning limits its applications in wider scopes. The data heterogeneity from diverse clients leads to weight divergence between local models and degrades the global performance of federated learning. To mitigate data heterogeneity, supplementing training data in federated learning has been explored and proven effective. However, traditional methods of supplementing data raise privacy concerns and increase learning costs. In this paper, we propose a solution to supplement training data with a generative model that is transparent to local clients. We keep the learning of the generative model on the server side and store the supplementary data from the generative model on the server side as well. This approach avoids collecting auxiliary data directly from local clients, reducing privacy concerns for them and preventing rising costs for local clients. To avoid loose learning on the real and synthetic samples, we constrain the optimization of the global model with a distance between the training global model and the distribution of the aggregated global model. Extensive experiments have verified that the synthetic data from the generative model improve the performance of federated learning, especially in a heterogeneous environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supplement data in federated learning with a generator transparent to clients

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Mar 7, 2024
Citations: 1

Similar Papers

GFL: Federated Learning on Non-IID Data via Privacy-Preserving Synthetic Data
Yihang Cheng ... Anran Li
-
Yihang Cheng, et. al.Yihang Cheng ... Anran Li
13 Mar 2023
13 Mar 2023

Decentralised Learning from Independent Multi-Domain Labels for Person Re-Identification
Guile Wu ... Shaogang Gong
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Guile Wu, et. al.Guile Wu ... Shaogang Gong
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Aquatic toxicity (Pre)screening strategy for structurally diverse chemicals: global or local classification tree models?
Agnieszka Gajewicz-Skretna ... Noriyuki Suzuki
Ecotoxicology and Environmental Safety | VOL. 208
Agnieszka Gajewicz-Skretna, et. al.Agnieszka Gajewicz-Skretna ... Noriyuki Suzuki
09 Dec 2020
Ecotoxicology and Environmental Safety | VOL. 208

Federated variational generative learning for heterogeneous data in distributed environments
Wei Xie ... Junzhou Luo
Journal of Parallel and Distributed Computing | VOL. 191
Wei Xie, et. al.Wei Xie ... Junzhou Luo
14 May 2024
Journal of Parallel and Distributed Computing | VOL. 191

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supplement data in federated learning with a generator transparent to clients

Abstract

Talk to us

Similar Papers

More From: Information Sciences