Enhancing Federated Learning With Server-Side Unlabeled Data by Adaptive Client and Data Selection

Yang Xu,Jianchun Liu,Lun Wang,Zhiyuan Wang,Liusheng Huang,Hongli Xu

doi:10.1109/tmc.2023.3265010

Abstract

Federated learning (FL) has been widely applied to collaboratively train deep learning (DL) models on massive end devices (i.e., clients). Due to the limited storage capacity and high labeling cost, the data on each client may be insufficient for model training. Conversely, in cloud datacenters, there exist large-scale unlabeled data, which are easy to collect from public access (e.g., social media). Herein, we propose the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Ada-FedSemi</i> system, which leverages both on-device labeled data and in-cloud unlabeled data to boost the performance of DL models. In each round, local models are aggregated to produce pseudo-labels for the unlabeled data, which are utilized to enhance the global model. Considering that the number of participating clients and the quality of pseudo-labels will have a significant impact on the training performance, we introduce a multi-armed bandit (MAB) based online algorithm to adaptively determine the participating fraction and confidence threshold. Besides, to alleviate the impact of stragglers, we assign local models of different depths for heterogeneous clients. Extensive experiments on benchmark models and datasets show that given the same resource budget, the model trained by Ada-FedSemi achieves 3% <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\sim$</tex-math></inline-formula> 14.8% higher test accuracy than that of the baseline methods. When achieving the same test accuracy, Ada-FedSemi saves up to 48% training cost, compared with the baselines. Under the scenario with heterogeneous clients, the proposed HeteroAda-FedSemi can further speed up the training process by 1.3× <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\sim 1.5\times$</tex-math></inline-formula> .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Federated Learning With Server-Side Unlabeled Data by Adaptive Client and Data Selection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Mobile Computing

Lead the way for us

Similar Papers

Enhancing Federated Learning with In-Cloud Unlabeled Data
Lun Wang ... Zhiyuan Wang
-
Lun Wang, et. al.Lun Wang ... Zhiyuan Wang
01 May 2022
01 May 2022

A lightweight and privacy preserved federated learning ecosystem for analyzing verbal communication emotions in identical and non-identical databases
Muskan Chawla ... Shyama Barna Bhattacharjee
Measurement: Sensors | VOL. 34
Muskan Chawla, et. al.Muskan Chawla ... Shyama Barna Bhattacharjee
26 Jun 2024
Measurement: Sensors | VOL. 34

DESIGN: Online Device Selection and Edge Association for Federated Synergy Learning-enabled AIoT
Shucun Fu ... Dian Shen
ACM Transactions on Intelligent Systems and Technology | VOL. -
Shucun Fu, et. al.Shucun Fu ... Dian Shen
15 Jun 2024
ACM Transactions on Intelligent Systems and Technology | VOL. -

A Greedy Agglomerative Framework for Clustered Federated Learning
Manan Mehta ... Chenhui Shao
IEEE Transactions on Industrial Informatics | VOL. 19
Manan Mehta, et. al.Manan Mehta ... Chenhui Shao
01 Dec 2023
IEEE Transactions on Industrial Informatics | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Federated Learning With Server-Side Unlabeled Data by Adaptive Client and Data Selection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Mobile Computing