non-IID Data Research Articles

In Federated Deep Learning (FDL), multiple local enterprises are allowed to train a model jointly. Then, they submit their local updates to the central server, and the server aggregates the updates to create a global model. However, trained models usually perform worse than centralized models, especially when the training data distribution is non-independent and identically distributed (non-IID). Because non-IID data harms the accuracy and performance of the model. Second, due to the centrality of federated learning (FL) and the untrustworthiness of enterprises, traditional FL solutions are vulnerable to security and privacy attacks. Therefore, to tackle this issue, we propose FEDANIL, a secure blockchain-enabled Federated Deep LeArning Model that improves enterprise models’ decentralized, performance, and tamper-proof properties, including two main phases. The first phase is proposed to address the non-IID challenge (label and feature distribution skew). In this phase, local models with similar data distributions are grouped into homogeneous clusters using the cosine similarity (CS) and affinity propagation (AP) techniques. Then, for each homogeneous cluster, Wasserstein Generative Adversarial Networks (WGAN) are used to deal with label and feature distribution skew. The second phase was adopted to address security and privacy concerns against poisoning and inference attacks via three steps. In the first step, data poisoning attacks are prevented by using CS. Then, in the second step, collude attacks were prevented by randomly selecting enterprises in the consortium blockchain. Finally, in the third step, model poisoning, membership inference, and reconstruction attacks have been prevented using the CKKS Fully Homomorphic Encryption (CKKS-FHE) technique and consortium blockchain. Extensive experiments were conducted using the Sent140, Fashion-MNIST, FEMNIST, and CIFAR-10 new real-world datasets to evaluate FedAnil’s robustness and performance. The simulation results demonstrate that FEDANIL satisfies FDL privacy-preserving requirements. In terms of convergence analysis, the model parameter obtained with the FEDANIL converges to the optimum of the model parameter. In addition, it performs better in terms of accuracy (more than 11, 15, and 24%) and computation overhead (less than 8, 10, and 15%) compared with baseline approaches, namely SHIELDFL, RVPFL, and RFA, respectively. The FEDANIL source code can be found on GitHub.11Code available on GitHub Repository: https://github.com/rezafotohi/FedAnil. For any questions about the code, please contact Fotohi.reza@gmail.com.

Read full abstract

In the realm of targeted advertising, the demand for precision is paramount, and the traditional centralized machine learning paradigm fails to address this necessity effectively. Two critical challenges persist in the current advertising ecosystem: the data privacy concerns leading to isolated data islands and the complexity in handling non-Independent and Identically Distributed (non-IID) data and concept drift due to the specificity and diversity in user behavior data. Current federated learning frameworks struggle to overcome these hurdles satisfactorily. This paper introduces Fed-GANCC, an innovative federated learning framework that synergizes Generative Adversarial Networks (GANs) and Group Clustering. The framework incorporates a user data augmentation algorithm predicated on adversarial generative networks to enrich user behavior data, curtail the impact of non-uniform data distribution, and enhance the applicability of the global machine learning model. Unlike traditional approaches, our framework offers user data augmentation algorithms based on adversarial generative networks, which not only enriches user behavior data but also reduces the challenges posed by non-uniform data distribution, thereby enhancing the applicability of the global machine learning (ML) model. The effectiveness of Fed-GANCC is distinctly showcased through experimental results, outperforming contemporary methods like FED-AVG and FED-SGD in terms of accuracy, loss value, and receiver operating characteristic (ROC) indicators within the same computing time. Experimental results vindicate the effectiveness of Fed-GANCC, revealing substantial enhancements in accuracy, loss value, and receiver operating characteristic (ROC) metrics compared to FED-AVG and FED-SGD given the same computational time. These outcomes underline Fed-GANCC's exceptional prowess in mitigating issues such as isolated data islands, non-IID data, and concept drift. With its novel approach to addressing the prevailing challenges in targeted advertising such as isolated data islands, non-IID data, and concept drift, the Fed-GANCC framework stands as a benchmark, paving the way for future advancements in federated learning solutions tailored for the advertising domain. The Fed-GANCC framework promises to offer pivotal insights for the future development of efficient and advanced federated learning solutions for targeted advertising.

Read full abstract

non-IID Data Research Articles

Articles published on non-IID Data

Transfer learning via random forests: A one-shot federated approach

Federated learning using model projection for multi-center disease diagnosis with non-IID data

IOFL: Intelligent-Optimization-Based Federated Learning for Non-IID Data

Decentralized and robust privacy-preserving model using blockchain-enabled Federated Deep Learning in intelligent enterprises

SF-CABD: Secure Byzantine fault tolerance federated learning on Non-IID data

Asynchronous Federated Learning with Grey Wolf Optimization for the Heterogeneity IoT Devices

Edge Federated Optimization for Heterogeneous Data

Two-stage sampling with predicted distribution changes in federated semi-supervised learning

"Federated Learning: Advancements, Applications, and Future Directions for Collaborative Machine Learning in Distributed Environments"

Empowering precise advertising with Fed-GANCC: A novel federated learning approach leveraging Generative Adversarial Networks and group clustering.

Fed-mRMR: A lossless federated feature selection method

Defending Against Data Poisoning Attack in Federated Learning With Non-IID Data

Ensemble Federated Learning With Non-IID Data in Wireless Networks

DRL-empowered joint batch size and weighted aggregation adjustment mechanism for federated learning on non-IID data

Joint Client Scheduling and Wireless Resource Allocation for Heterogeneous Federated Edge Learning With Non-IID Data

Analyzing the robustness of decentralized horizontal and vertical federated learning architectures in a non-IID scenario

Personalized federated learning for dealing with computational latency and non-IID data scenarios

Communication Efficiency and Non-Independent and Identically Distributed Data Challenge in Federated Learning: A Systematic Mapping Study

Federated Learning Enabled IDS for Internet of Things on non-IID Data

Exploiting Label Skews in Federated Learning with Model Concatenation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

non-IID Data Research Articles

Articles published on non-IID Data

Transfer learning via random forests: A one-shot federated approach

Federated learning using model projection for multi-center disease diagnosis with non-IID data

IOFL: Intelligent-Optimization-Based Federated Learning for Non-IID Data

Decentralized and robust privacy-preserving model using blockchain-enabled Federated Deep Learning in intelligent enterprises

SF-CABD: Secure Byzantine fault tolerance federated learning on Non-IID data

Asynchronous Federated Learning with Grey Wolf Optimization for the Heterogeneity IoT Devices

Edge Federated Optimization for Heterogeneous Data

Two-stage sampling with predicted distribution changes in federated semi-supervised learning

"Federated Learning: Advancements, Applications, and Future Directions for Collaborative Machine Learning in Distributed Environments"

Empowering precise advertising with Fed-GANCC: A novel federated learning approach leveraging Generative Adversarial Networks and group clustering.

Fed-mRMR: A lossless federated feature selection method

Defending Against Data Poisoning Attack in Federated Learning With Non-IID Data

Ensemble Federated Learning With Non-IID Data in Wireless Networks

DRL-empowered joint batch size and weighted aggregation adjustment mechanism for federated learning on non-IID data

Joint Client Scheduling and Wireless Resource Allocation for Heterogeneous Federated Edge Learning With Non-IID Data

Analyzing the robustness of decentralized horizontal and vertical federated learning architectures in a non-IID scenario

Personalized federated learning for dealing with computational latency and non-IID data scenarios

Communication Efficiency and Non-Independent and Identically Distributed Data Challenge in Federated Learning: A Systematic Mapping Study

Federated Learning Enabled IDS for Internet of Things on non-IID Data

Exploiting Label Skews in Federated Learning with Model Concatenation