Flexible Clustered Federated Learning for Client-Level Data Distribution Shift

Moming Duan,Xianzhang Chen,Xinyuan Ji,Liang Liang,Duo Liu,Yujuan Tan,Yu Wu,Ao Ren

doi:10.1109/tpds.2021.3134263

Moming Duan, Xianzhang Chen + Show 6 more

Open Access

PDF Available

https://doi.org/10.1109/tpds.2021.3134263

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while keeping the training data locally. Unlike the centralized training setting, the non-IID, imbalanced (statistical heterogeneity) and distribution shifted training data of FL is distributed in the federated network, which will increase the divergences between the local models and the global model, further degrading performance. In this paper, we propose a flexible clustered federated learning (CFL) framework named FlexCFL, in which we 1) group the training of clients based on the similarities between the clients’ optimization directions for lower training divergence; 2) implement an efficient newcomer device cold start mechanism for framework scalability and practicality; 3) flexibly migrate clients to meet the challenge of client-level data distribution shift. FlexCFL can achieve improvements by dividing joint optimization into groups of sub-optimization and can strike a balance between accuracy and communication efficiency in the distribution shift environment. The convergence and complexity are analyzed to demonstrate the efficiency of FlexCFL. We also evaluate FlexCFL on several open datasets and made comparisons with related CFL frameworks. The results show that FlexCFL can significantly improve absolute test accuracy by <inline-formula><tex-math notation="LaTeX">$+10.6\%$</tex-math></inline-formula> on FEMNIST compared with FedAvg , <inline-formula><tex-math notation="LaTeX">$+3.5\%$</tex-math></inline-formula> on FashionMNIST compared with FedProx , <inline-formula><tex-math notation="LaTeX">$+8.4\%$</tex-math></inline-formula> on MNIST compared with FeSEM , <inline-formula><tex-math notation="LaTeX">$+4.7\%$</tex-math></inline-formula> on Sentiment140 compare with IFCA . The experiment results show that FlexCFL is also communication efficient in the distribution shift environment.

Full Text