Untrusted Data Research Articles

With the advent of the Internet of things (IoT) era, federated learning plays an important role in breaking through traditional data barriers and effectively realizing data privacy and security in the process of sharing. However, the demand of the practical problems makes the algorithm still have great challenges in effectively balancing various factors, such as privacy security, accuracy, computing efficiency and so on. To challenge this problem, a two-stage federated optimization algorithm based on robust and multitasking learning is designed. In optimization client local model stage, an adaptive weight assignment mechanism is adopted to guide robust learning based on multiple untrusted sources data, which aims to ensure the credibility and robustness of the client model and obtain a reliable client model. To address the information leakage problem during the server–client global model optimization stage, a privacy patch layer is added to the client local model in multitask learning and maintain its privacy parameters stored on the client model during the global model parameter aggregation process, which aims to meet the personalized requirements and performance requirements of protecting the model privacy. To effectively measure the performance of our algorithm, two extensive experiments are carried out to verify the robustness and accuracy of model under different datasets, respectively. In addition, simulation results show that our algorithm successfully suppresses the impact of corrupted or irrelevant sources on performance, and its performance is better than the other two robust distributed learning baseline methods in the client local model optimization stage. At the same time, our algorithm achieves better accuracy performance than other advanced personalized optimization algorithms in the server–client global model optimization stage. Finally, it achieves a good balance between robustness, computational efficiency and model privacy protection.

Read full abstract

In a typical data pipeline, the dataflow starts from the first node, where the data is initiated and moves to the last node in the pipeline, where the processed data will be stored. Due to the sheer number of the involved participants, it is crucial to protect the dataflow integrity in the pipeline. While previous studies have outlined solutions to this matter, the solution for an untrusted data pipeline is still left unexplored, which motivates us to propose SIGNORA. Our proposal combines the concept of a chain of signatures with blockchain receipt to provide dataflow integrity in a data pipeline. The chain of signatures provides a non-repudiation guarantee from participants, while the hash of the data and signatures is anchored in the blockchain for a non-tampering guarantee through blockchain receipt. Aside from that, SIGNORA also satisfies essential requirements of running data pipeline processing in an open and untrusted environment, such as (i) providing reliable identity management, (ii) solving the trust and accountability issues through a reputation system, and (iii) supporting various devices through multiple cryptographic algorithms (i.e., ECDSA, EdDSA, RSA, and HMAC) and (iv) off-chain processing. Our experiment results show that SIGNORA can provide dataflow integrity provisioning in multiple scenarios of data payload size with reasonable overhead. Furthermore, the cost of smart contract methods has also been analyzed, and several off-chain solutions have been addressed to reduce transaction costs. Finally, the reputation system can adapt to the history of nodes’ activities by increasing their scores when they actively perform honest behavior while reducing their scores when they become inactive. Therefore, SIGNORA can provide a high degree of accountability for participants collaborating in an untrusted environment.

Read full abstract

Untrusted Data Research Articles

Papers published in recent years

Articles published on Untrusted Data

SecureQwen: Leveraging LLMs for vulnerability detection in python codebases

Data access control for named data of health things

LTEM: Lightweight Trust Evaluation Model in IoT Environment

A Secure Face Verification Scheme Based on Fully Homomorphic Encryption with Anonymity

Trusted-Data-Guided Label Enhancement on Noisy Labels.

Fake News Detection Using Machine Learning

Federated learning‐based private medical knowledge graph for epidemic surveillance in internet of things

InDaMul: Incentivized Data Mules for Opportunistic Networking Through Smart Contracts and Decentralized Systems

Trust-based distributed state estimation for microgrids with encoding-decoding mechanisms

A two-stage federated optimization algorithm for privacy computing in Internet of Things

Collaborative Sampling for Partial Multi-Dimensional Value Collection Under Local Differential Privacy

Research into Occurrence of Insecurely-Serialized Objects in Client-Side Code of Web-Applications

CWT-DPA: Component-wise waiting time for BC-enabled data plane authentication

GwPFV: A novel packet forwarding verification mechanism based on gateways in SDN-based storage environment

Sensitivity of Machine Learning Approaches to Fake and Untrusted Data in Healthcare Domain

A Trust-Driven Contract Incentive Scheme for Mobile Crowd-Sensing Networks

SIGNORA: A Blockchain-Based Framework for Dataflow Integrity Provisioning in an Untrusted Data Pipeline

Provable Data Possession (PDP) and Proofs of Retrievability (POR) of Current Big User Data

CGM

From Trustworthy Data to Trustworthy IoT

Editage

Paperpal

R Discovery

Mind the Graph

Untrusted Data Research Articles

Papers published in recent years

Articles published on Untrusted Data

SecureQwen: Leveraging LLMs for vulnerability detection in python codebases

Data access control for named data of health things

LTEM: Lightweight Trust Evaluation Model in IoT Environment

A Secure Face Verification Scheme Based on Fully Homomorphic Encryption with Anonymity

Trusted-Data-Guided Label Enhancement on Noisy Labels.

Fake News Detection Using Machine Learning

Federated learning‐based private medical knowledge graph for epidemic surveillance in internet of things

InDaMul: Incentivized Data Mules for Opportunistic Networking Through Smart Contracts and Decentralized Systems

Trust-based distributed state estimation for microgrids with encoding-decoding mechanisms

A two-stage federated optimization algorithm for privacy computing in Internet of Things

Collaborative Sampling for Partial Multi-Dimensional Value Collection Under Local Differential Privacy

Research into Occurrence of Insecurely-Serialized Objects in Client-Side Code of Web-Applications

CWT-DPA: Component-wise waiting time for BC-enabled data plane authentication

GwPFV: A novel packet forwarding verification mechanism based on gateways in SDN-based storage environment

Sensitivity of Machine Learning Approaches to Fake and Untrusted Data in Healthcare Domain

A Trust-Driven Contract Incentive Scheme for Mobile Crowd-Sensing Networks

SIGNORA: A Blockchain-Based Framework for Dataflow Integrity Provisioning in an Untrusted Data Pipeline

Provable Data Possession (PDP) and Proofs of Retrievability (POR) of Current Big User Data

CGM

From Trustworthy Data to Trustworthy IoT