Auction-Driven Utility Maximization for Multiple Federated Learning in Smart Healthcare
Federated Learning (FL), a decentralized Machine Learning (ML) approach, allows Wireless Body Area Network (WBAN) users to collaboratively train models while maintaining the privacy of their health data. With the rise of ML-powered smart healthcare applications and increasing demand for diverse services, the simultaneous training of multiple FL models using data from WBANs is becoming feasible. However, managing multiple FL models with distinct learning objectives presents challenges in designing incentives and selecting suitable users—an area that remains largely unexplored. Thus, this paper introduces an auction-based incentive mechanism and WBAN users selection framework to enable the parallel training of multiple FL models, ensuring privacy of data. An optimization problem is formulated to maximize system utility, incorporating a cost model that includes data collection, computation, communication, and privacy. The proposed auction-based algorithm integrates factors such as local model accuracy, user reputation, and data volume to solve this problem efficiently. Simulations and real-world health data analysis demonstrate that this approach improves average utility by 15.9% and 18.08% compared to state-of-the-art methods.
- Conference Article
82
- 10.1109/icc40277.2020.9148815
- Jun 1, 2020
In this paper, the convergence time of federated learning (FL), when deployed over a realistic wireless network, is studied. In particular, with the considered model, wireless users transmit their local FL models (trained using their locally collected data) to a base station (BS). The BS, acting as a central controller, generates a global FL model using the received local FL models and broadcasts it back to all users. Due to the limited number of resource blocks (RBs) in a wireless network, only a subset of users can be selected and transmit their local FL model parameters to the BS at each learning step. Meanwhile, since each user has unique training data samples and the BS must wait to receive all users' local FL models to generate the global FL model, the FL performance and convergence time will be significantly affected by the user selection scheme. In consequence, it is necessary to design an appropriate user selection scheme that enables all users to execute an FL scheme and efficiently train it. This joint learning, wireless resource allocation, and user selection problem is formulated as an optimization problem whose goal is to minimize the FL convergence time while optimizing the FL performance. To address this problem, a probabilistic user selection scheme is proposed using which the BS will connect to the users, whose local FL models have large effects on its global FL model, with high probabilities. Given the user selection policy, the uplink RB allocation can be determined. To further reduce the FL convergence time, artificial neural networks (ANNs) are used to estimate the local FL models of the users that are not allocated any RBs for local FL model transmission, which enables the BS to include more users' local FL models to generate the global FL model so as to improve the FL convergence speed and performance. Simulation results show that the proposed ANN-based FL scheme can reduce the FL convergence time by up to 53.8%, compared to a standard FL algorithm.
- Research Article
392
- 10.1109/twc.2020.3042530
- Dec 11, 2020
- IEEE Transactions on Wireless Communications
In this paper, the convergence time of federated learning (FL), when deployed over a realistic wireless network, is studied. In particular, a wireless network is considered in which wireless users transmit their local FL models (trained using their locally collected data) to a base station (BS). The BS, acting as a central controller, generates a global FL model using the received local FL models and broadcasts it back to all users. Due to the limited number of resource blocks (RBs) in a wireless network, only a subset of users can be selected to transmit their local FL model parameters to the BS at each learning step. Moreover, since each user has unique training data samples, the BS prefers to include all local user FL models to generate a converged global FL model. Hence, the FL training loss and convergence time will be significantly affected by the user selection scheme. Therefore, it is necessary to design an appropriate user selection scheme that can select the users who can contribute toward improving the FL convergence speed more frequently. This joint learning, wireless resource allocation, and user selection problem is formulated as an optimization problem whose goal is to minimize the FL convergence time and the FL training loss. To solve this problem, a probabilistic user selection scheme is proposed such that the BS is connected to the users whose local FL models have significant effects on the global FL model with high probabilities. Given the user selection policy, the uplink RB allocation can be determined. To further reduce the FL convergence time, artificial neural networks (ANNs) are used to estimate the local FL models of the users that are not allocated any RBs for local FL model transmission at each given learning step, which enables the BS to improve the global model, the FL convergence speed, and the training loss. Simulation results show that the proposed approach can reduce the FL convergence time by up to 56% and improve the accuracy of identifying handwritten digits by up to 3%, compared to a standard FL algorithm.
- Conference Article
14
- 10.1109/spawc48557.2020.9154300
- May 1, 2020
In this paper, the problem of training federated learning (FL) algorithms over a wireless network with mobile users is studied. In the considered model, several mobile users and a network base station (BS) cooperatively perform an FL algorithm. In particular, the wireless mobile users train their local FL models and send the trained local FL model parameters to the BS. The BS will then integrate the received local FL models to generate a global FL model and send it back to all users. Due to the limited training time at each iteration, the number of users that can transmit their local FL models to the BS will be affected by changes in the users' locations and wireless channels. In this paper, this joint learning, user selection, and wireless resource allocation problem is formulated as an optimization problem whose goal is to minimize the FL loss function, which captures the FL performance, while meeting the transmission delay requirement. To solve this problem, a closed-form expression for the expected convergence rate of the FL algorithm is first derived to quantify the impact of the users' mobility and wireless factors on FL. Then, based on the expected FL convergence rate, the user selection and uplink resource allocation is optimized at each FL iteration so as to minimize the FL loss function while satisfying the FL parameter transmission delay requirement. Simulation results show that the proposed approach can reduce the FL loss function value by up to 20% compared to a standard FL algorithm.
- Research Article
1
- 10.3390/electronics14081535
- Apr 10, 2025
- Electronics
With the requirements of government data protection regulations and industrial concerns regarding data protection and privacy, the security level required for data privacy and protection has increased. This has led researchers to investigate techniques that can train cybersecurity machine learning (ML) models without sharing personal data. Federated Learning (FL) is a newly developed decentralized and distributed ML mechanism that emphasize privacy. In this technique, a learning algorithm is trained without collecting or exchanging sensitive data from distributed client models running at different locations. With the rapid increase in the number of cybersecurity attacks reported in the aviation industry in the last two decades, strong, dynamic, and effective countermeasures are required to protect the aviation industry and air passengers against such attacks, which can most of the time lead to catastrophic situations. This paper proposes and implements an FL model for identifying cyberattacks on a Software Defined Network (SDN)-based aeronautical communication networks. The machine learning model used in the FL architecture is a Deep Neural Network (DNN) model. The publicly available National Security Laboratory–Knowledge Discovery and Datamining (NSL-KDD) dataset was employed to train and validate the proposed FL model. The simulation results illustrated that the FL-based system can accurately and effectively identify potential cybersecurity attacks and minimize the risk of data and service exposure without degrading model performance. A comparison was also made between the FL and non-FL machine learning models. Preliminary results demonstrated that the FL model outperformed the non-FL machine learning approaches. FL reached an accuracy of 96%, compared to 76% and 83% for NFL.
- Conference Article
105
- 10.1109/globecom38437.2019.9013160
- Dec 1, 2019
In this paper, the problem of training federated learning (FL) algorithms over a realistic wireless network is studied. In particular, in the considered model, wireless users perform an FL algorithm that trains their local FL models using their own data and send the trained local FL models to a base station (BS) that will generate a global FL model and send it back to the users. Since all training parameters are transmitted over wireless links, the quality of the training will be affected by wireless factors such as packet errors and availability of wireless resources. Meanwhile, due to the limited wireless bandwidth, the BS must select an appropriate subset of users to execute the FL learning algorithm so as to build a global FL model accurately. This joint learning, wireless resource allocation, and user selection problem is formulated as an optimization problem whose goal is to minimize an FL loss function that captures the performance of the FL algorithm. To address this problem, a closed-form expression for the expected convergence rate of the FL algorithm is first derived to quantify the impact of wireless factors on FL. Then, based on the expected convergence rate of the FL algorithm, the optimal transmit power for each user is derived, under a given user selection and uplink resource block (RB) allocation scheme. Finally, the user selection and uplink RB allocation is optimized so as to minimize the FL loss function. Simulation results show that the proposed joint federated learning and communication framework can reduce the FL loss function value by up to 10% and 16%, respectively, compared to 1) an optimal user selection algorithm with random resource allocation and 2) a random user selection and resource allocation algorithm.
- Research Article
50
- 10.1016/j.iswa.2022.200064
- May 1, 2022
- Intelligent Systems with Applications
• We propose an FL model to predict a client’s (or loan requester’s) financial situation by considering variant local epochs for the data holders of the clients (e.g., banks, financial organizations). • We leverage FL strategy that consider customer’s local resources to assign computational task during training. Particularly, the local computational tasks of each FL client is assigned based on their data volume, bandwidth, and network availability. • We analyze our prediction model by considering various batch sizes and client numbers for the training phase. • To the end, we visualize the performance of our FL model comparing with a centralized model, and also with a mean local model, and the best local model in an FL process. In recent years, as economic stability is shaking, and the unemployment rate is growing high due to the COVID-19 effect, assigning credit scoring by predicting consumers’ financial conditions has become more crucial. The conventional machine learning (ML) and deep learning approaches need to share customer’s sensitive information with an external credit bureau to generate a prediction model that opens up the door of privacy leakage. A recently invented privacy-preserving distributed ML scheme referred to as Federated learning (FL) enables generating a target model without sharing local information through on-device model training on edge resources. In this paper, we propose an FL-based application to predict customers’ financial issues by constructing a global learning model that is evolved based on the local models of the distributed agents. The local models are generated by the network agents using their on-device data and local resources. We used the FL concept because the learning strategy does not require sharing any data with the server or any other agent that ensures the preservation of customers’ sensitive data. To that end, we enable partial works from the weak agents that eliminate the issue if the model convergence is retarded due to straggler agents. We also leverage asynchronous FL that cut off the extra waiting time during global model generation. We simulated the performance of our FL model considering a popular dataset, Give me Some Credit (Freshcorn, 2017). We evaluated our proposed method considering a a different number of stragglers and setting up various computational tasks (e.g., local epoch, batch size), and simulated the training loss and testing accuracy of the prediction model. Finally, we compared the F1-score of our proposed model with the existing centralized and decentralized approaches. Our results show that our proposed model achieves an almost identical F1-score as like centralized model even when we set up a skew-level of more than 80 % and outperforms the state-of-the-art FL models by obtaining an average of 5 ∼ 6 % higher accuracy when we have resource-constrained agents within a learning environment.
- Research Article
10
- 10.1016/j.ophtha.2024.10.017
- Apr 1, 2025
- Ophthalmology
Privacy Preserving Technology using Federated Learning and Blockchain in protecting against Adversarial Attacks for Retinal Imaging
- Research Article
1539
- 10.1109/twc.2020.3024629
- Oct 2, 2020
- IEEE Transactions on Wireless Communications
In this article, the problem of training federated learning (FL) algorithms over a realistic wireless network is studied. In the considered model, wireless users execute an FL algorithm while training their local FL models using their own data and transmitting the trained local FL models to a base station (BS) that generates a global FL model and sends the model back to the users. Since all training parameters are transmitted over wireless links, the quality of training is affected by wireless factors such as packet errors and the availability of wireless resources. Meanwhile, due to the limited wireless bandwidth, the BS needs to select an appropriate subset of users to execute the FL algorithm so as to build a global FL model accurately. This joint learning, wireless resource allocation, and user selection problem is formulated as an optimization problem whose goal is to minimize an FL loss function that captures the performance of the FL algorithm. To seek the solution, a closed-form expression for the expected convergence rate of the FL algorithm is first derived to quantify the impact of wireless factors on FL. Then, based on the expected convergence rate of the FL algorithm, the optimal transmit power for each user is derived, under a given user selection and uplink resource block (RB) allocation scheme. Finally, the user selection and uplink RB allocation is optimized so as to minimize the FL loss function. Simulation results show that the proposed joint federated learning and communication framework can improve the identification accuracy by up to 1.4%, 3.5% and 4.1%, respectively, compared to: 1) An optimal user selection algorithm with random resource allocation, 2) a standard FL algorithm with random user selection and resource allocation, and 3) a wireless optimization algorithm that minimizes the sum packet error rates of all users while being agnostic to the FL parameters.
- Research Article
24
- 10.3390/electronics12194074
- Sep 28, 2023
- Electronics
During the COVID-19 pandemic, the urgency of effective testing strategies had never been more apparent. The fusion of Artificial Intelligence (AI) and Machine Learning (ML) models, particularly within medical imaging (e.g., chest X-rays), holds promise in smart healthcare systems. Deep Learning (DL), a subset of AI, has exhibited prowess in enhancing classification accuracy, a crucial aspect in expediting COVID-19 diagnosis. However, the journey to harness DL’s potential is rife with challenges: notably, the intricate landscape of medical data privacy. Striking a balance between utilizing patient data for insights while upholding privacy is formidable. Federated Learning (FL) emerges as a solution by enabling collaborative model training across decentralized data sources, thus bypassing data centralization and preserving data privacy. This study presents a tailored, collaborative FL architecture for COVID-19 screening via chest X-ray images. Designed to facilitate cooperation among medical institutions, the framework ensures patient data remain localized, eliminating the need for direct data sharing. Addressing imbalanced and non-identically distributed data, the architecture is a robust solution. Implementation entails localized and fog-computing-based FL models. Localized models utilize Convolutional Neural Networks (CNNs) on institution-specific datasets, while the FL model, refined iteratively, takes precedence in the final classification. Intriguingly, the global FL model, fortified by fog computing, emerges as the frontrunner in classification after weight refinement, surpassing local models. Validation within the COLAB platform gauges the model’s performance through metrics such as accuracy, precision, recall, and F1-score. Remarkably, the proposed model excels across these metrics, solidifying its efficacy. This research navigates the confluence of AI, FL, and medical imaging, unveiling insights that could reshape healthcare delivery. The study enriches scientific discourse by addressing data privacy in collaborative learning and carries potential implications for enhanced patient care.
- Research Article
- 10.3389/fnagi.2026.1766599
- Mar 9, 2026
- Frontiers in Aging Neuroscience
IntroductionCloud-based artificial intelligence (AI) combined with smart-health technology presents a powerful tool to passively monitor disease severity. However, current methods raise privacy concerns as they require transmitting patient data to the cloud. A potential solution is Federated Learning (FL), which only shares the weights of locally trained neural networks (NNs) instead of user data. Here, we simulated an FL system to demonstrate its application for evaluating Parkinson’s disease (PD) severity in a smart-home scenario.MethodsRetrospective data including 89 people with PD wore an accelerometer on the lower-back at home for 7 days at 18-month intervals over 6 years. Patient characteristics (age, sex, and body mass index) and clinical measures of PD were additionally collected, including the Movement Disorder Society Unified Parkinson’s Disease Rating Scale (MDS-UPDRS)-Part III. Real-world daily gait measures along with these patient characteristics were used to predict the MDS-UPDRS-III score. For FL, a local model was trained for each participant, and a global model (an aggregation of these local models) was tested on unseen participants.ResultsThe performance of a simulated FL system was compared with that of a traditional Machine Learning (ML) approach in which patient data were shared. The traditional ML approach had a mean absolute error (MAE) of 10.43. The global FL model had a similar MAE of 10.22 but was underfitted, and the mean MAE of the local, personalised models was 4.83. Shapley Additive exPlanations (SHAP) analysis showed that while the participants’ age and sex were very important in traditional ML, this was not the case for the local FL models, leading to a decrease in global model performance. Here, we show that reserving a small number of participants from the system and including them in training data for all local models restored the importance of these features and improved global FL performance (MAE = 9.26) but reduced local performance (MAE = 6.83).ConclusionThis exploratory study shows that our proposed approach enables FL to achieve similar accuracy to traditional Machine Learning without sharing any patient data but with costs to the local performance, leading towards a smart-home system that prioritises personalisation and patient privacy.
- Conference Article
8
- 10.1109/icdcs54860.2022.00088
- Jul 1, 2022
In Internet-of-Vehicles (IoV), smart vehicles can efficiently process various sensing data through federated learning (FL) - a privacy-preserving distributed machine learning (ML) approach that allows collaborative development of the shared ML model without any data exchange. However, traditional FL approaches suffer from poor security against the system noise, e.g., due to low-quality trained data, wireless channel errors, and malicious vehicles generating erroneous results, which affects the accuracy of the developed ML model. To address this problem, we propose a novel FL model based on the concept of Lagrange coded computing (LCC) - a coded distributed computing (CDC) scheme that enables enhancing the system security. In particular, we design the first L-CoFL (Lagrange coded FL) model to improve the accuracy of FL computations in the presence of lowquality trained data and wireless channel errors, and guarantee the system security against malicious vehicles. We apply the proposed L-CoFL model to predict the traffic slowness in IoV and verify the superior performance of our model through extensive simulations.
- Research Article
13
- 10.1109/access.2023.3289220
- Jan 1, 2023
- IEEE Access
The past decade has seen substantial growth in the prevalence and capabilities of wearable devices. For instance, recent human activity recognition (HAR) research has explored using wearable devices in applications such as remote monitoring of patients, detection of gait abnormalities, and cognitive disease identification. However, data collection poses a major challenge in developing HAR systems, especially because of the need to store data at a central location. This raises privacy concerns and makes continuous data collection difficult and expensive due to the high cost of transferring data from a user’s wearable device to a central repository. Considering this, we explore the adoption of federated learning (FL) as a potential solution to address the privacy and cost issues associated with data collection in HAR. More specifically, we investigate the performance and behavioral differences between FL and deep learning (DL) HAR models, under various conditions relevant to real-world deployments. Namely, we explore the differences between the two types of models when (i) using data from different sensor placements, (ii) having access to users with data from heterogeneous sensor placements, (iii) considering bandwidth efficiency, and (iv) dealing with data with incorrect labels. Our results show that FL models suffer from a consistent performance deficit in comparison to their DL counterparts, but achieve these results with much better bandwidth efficiency. Furthermore, we observe that FL models exhibit very similar responses to those of DL models when exposed to data from heterogeneous sensor placements. Finally, we show that the FL models are more robust to data with incorrect labels than their centralized DL counterparts.
- Conference Article
4
- 10.1109/iwcmc55113.2022.9825004
- May 30, 2022
Federated Learning (FL) is one of the hot research topics, and it utilizes Machine Learning (ML) in a distributed manner without directly accessing private data on clients. How-ever, FL faces many challenges, including the difficulty to obtain high accuracy, high communication cost between clients and the server, and security attacks related to adversarial ML. To tackle these three challenges, we propose an FL algorithm inspired by evolutionary techniques. The proposed algorithm groups clients randomly in many clusters, each with a model selected randomly to explore the performance of different models. The clusters are then trained in a repetitive process where the worst performing cluster is removed in each iteration until one cluster remains. In each iteration, some clients are expelled from clusters either due to using poisoned data or low performance. The surviving clients are exploited in the next iteration. The remaining cluster with surviving clients is then used for training the best FL model (i.e., remaining FL model). Communication cost is reduced since fewer clients are used in the final training of the FL model. To evaluate the performance of the proposed algorithm, we conduct a number of experiments using FEMNIST dataset and compare the result against the random FL algorithm. The experimental results show that the proposed algorithm outperforms the baseline algorithm in terms of accuracy, communication cost, and security.
- Conference Article
182
- 10.1109/iwqos52092.2021.9521274
- Jun 25, 2021
Federated learning (FL) has recently emerged as a promising distributed machine learning (ML) paradigm. Practical needs of the right to be forgotten and countering data poisoning attacks call for efficient techniques that can remove, or unlearn, specific training data from the trained FL model. Existing unlearning techniques in the context of ML, however, are no longer in effect for FL, mainly due to the inherent distinction in the way how FL and ML learn from data. Therefore, how to enable efficient data removal from FL models remains largely under-explored. In this paper, we take the first step to fill this gap by presenting FedEraser, the first federated unlearning method-ology that can eliminate the influence of a federated client’s data on the global FL model while significantly reducing the time used for constructing the unlearned FL model. The basic idea of FedEraser is to trade the central server’s storage for unlearned model’s construction time, where FedEraser reconstructs the unlearned model by leveraging the historical parameter updates of federated clients that have been retained at the central server during the training process of FL. A novel calibration method is further developed to calibrate the retained updates, which are further used to promptly construct the unlearned model, yielding a significant speed-up to the reconstruction of the unlearned model while maintaining the model efficacy. Experiments on four realistic datasets demonstrate the effectiveness of FedEraser, with an expected speed-up of 4× compared with retraining from the scratch. We envision our work as an early step in FL towards compliance with legal and ethical criteria in a fair and transparent manner.
- Research Article
47
- 10.1016/j.future.2023.10.013
- Oct 31, 2023
- Future Generation Computer Systems
FederatedTrust: A solution for trustworthy federated learning