Abstract

Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while keeping the training data locally. Unlike the centralized training setting, the non-IID, imbalanced (statistical heterogeneity) and distribution shifted training data of FL is distributed in the federated network, which will increase the divergences between the local models and the global model, further degrading performance. In this paper, we propose a flexible clustered federated learning (CFL) framework named FlexCFL, in which we 1) group the training of clients based on the similarities between the clients’ optimization directions for lower training divergence; 2) implement an efficient newcomer device cold start mechanism for framework scalability and practicality; 3) flexibly migrate clients to meet the challenge of client-level data distribution shift. FlexCFL can achieve improvements by dividing joint optimization into groups of sub-optimization and can strike a balance between accuracy and communication efficiency in the distribution shift environment. The convergence and complexity are analyzed to demonstrate the efficiency of FlexCFL. We also evaluate FlexCFL on several open datasets and made comparisons with related CFL frameworks. The results show that FlexCFL can significantly improve absolute test accuracy by <inline-formula><tex-math notation="LaTeX">$+10.6\%$</tex-math></inline-formula> on FEMNIST compared with <i>FedAvg</i> , <inline-formula><tex-math notation="LaTeX">$+3.5\%$</tex-math></inline-formula> on FashionMNIST compared with <i>FedProx</i> , <inline-formula><tex-math notation="LaTeX">$+8.4\%$</tex-math></inline-formula> on MNIST compared with <i>FeSEM</i> , <inline-formula><tex-math notation="LaTeX">$+4.7\%$</tex-math></inline-formula> on Sentiment140 compare with <i>IFCA</i> . The experiment results show that FlexCFL is also communication efficient in the distribution shift environment.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call