Provable Privacy Guarantees Research Articles

Transportation networks are essential to the operation of societies and economies. Protecting the privacy of sensitive information is a meaningful conception in sustainable transport when mining the transportation data. In data mining, differential privacy (DP) has provable privacy guarantees for releasing sensitive data by introducing randomness into query results. However, it suffers from significant accuracy loss of outputs when the query has high sensitivity (e.g., triangle counting). The reason is that the range of random perturbation to each query result in DP is too large. It consists of all possible output values for a query that forms a large or even unbounded interval. However, when impose perturbation only in a small neighborhood of the true query result, the similarity measure based on randomness in DP fails. Thereupon, we introduce fuzziness into DP to formulate new models which have smaller disturbance via fuzzy similarity measures. In this article, we establish a novel and general theory of private data analysis, fuzzy differential privacy (FDP). The new theory FDP aims to acquire a more flexible tradeoff between the accuracy of outputs and the privacy-preserving level of data. FDP combines DP with fuzzy set theory by introducing fuzziness into the query results and characterizing similarities between outputs via multiple fuzzy similarity measures. From this perspective, DP can be viewed as a special case of FDP with probabilistic similarity measure. Compared with DP, FDP has three superiorities: 1) most fuzzy similarity measures in FDP support sliding window perturbation strategies we proposed, which refer to perturbation in a small neighborhood of the query results; 2) FDP adds noise to the query results only according to a fraction of all possible neighboring datasets; and 3) the fuzzy similarity with valued in [0,1] quantifies the privacy protection level intuitively. These three points enable more accurate outputs while providing provable and intuitive privacy guarantees. As for subgraph counting, the state-of-the-art method is ladder framework in DP. We illustrate FDP mechanisms by applying them to a common application in subgraph counting–triangle/4-cliques counting. Experiments show that FDP is effective and efficient with smaller output errors than DP.

Read full abstract

When sharing relational databases with other parties, in addition to providing high quality (utility) database to the recipients, a database owner also aims to have (i) privacy guarantees for the data entries and (ii) liability guarantees (via fingerprinting) in case of unauthorized redistribution. However, (i) and (ii) are orthogonal objectives, because when sharing a database with multiple recipients, privacy via data sanitization requires adding noise once (and sharing the same noisy version with all recipients), whereas liability via unique fingerprint insertion requires adding different noises to each shared copy to distinguish all recipients. Although achieving (i) and (ii) together is possible in a naïve way (e.g., either differentially-private database perturbation or synthesis followed by fingerprinting), this approach results in significant degradation in the utility of shared databases. In this paper, we achieve privacy and liability guarantees simultaneously by proposing a novel entry-level differentially-private (DP) fingerprinting mechanism for relational databases without causing large utility degradation. The proposed mechanism fulfills the privacy and liability requirements by leveraging the randomization nature of fingerprinting and transforming it into provable privacy guarantees. Specifically, we devise a bit-level random response scheme to achieve differential privacy guarantee for arbitrary data entries when sharing the entire database, and then, based on this, we develop an -entry-level DP fingerprinting mechanism. We theoretically analyze the connections between privacy, fingerprint robustness, and database utility by deriving closed form expressions. We also propose a sparse vector technique-based solution to control the cumulative privacy loss when fingerprinted copies of a database are shared with multiple recipients. We experimentally show that our mechanism achieves strong fingerprint robustness (e.g., the fingerprint cannot be compromised even if the malicious database recipient modifies/distorts more than half of the entries in its received fingerprinted copy), and higher database utility compared to various baseline methods (e.g., application-dependent database utility of the shared database achieved by the proposed mechanism is higher than that of the considered baselines).

Read full abstract

Provable Privacy Guarantees Research Articles

Related Topics

Articles published on Provable Privacy Guarantees

Task-Agnostic Privacy-Preserving Representation Learning for Federated Learning against Attribute Inference Attacks

PKDGAN: Private Knowledge Distillation With Generative Adversarial Networks

DProvDB: Differentially Private Query Processing with Multi-Analyst Provenance

Trajectory Data Collection with Local Differential Privacy

Privacy-Preserving federated learning: An application for big data load forecast in buildings

Fuzzy Differential Privacy Theory and Its Applications in Subgraph Counting

Privacy-Preserving Database Fingerprinting.

Differentially private block coordinate descent

No Free Lunch Theorem for Security and Utility in Federated Learning

PADP-FedMeta: A personalized and adaptive differentially private federated meta learning mechanism for AIoT

Covariance’s Loss is Privacy’s Gain: Computationally Efficient, Private and Accurate Synthetic Data

Federating recommendations using differentially private prototypes

Fairness and Cost Constrained Privacy-Aware Record Linkage

Aggregation and Transformation of Vector-Valued Messages in the Shuffle Model of Differential Privacy

Preserving User Privacy in Personalized Networks

Secure DNA Motif-Finding Method Based on Sampling Candidate Pruning

A Comparison of Algorithms for Solving the Multiagent Simple Temporal Problem

Obfuscation of images via differential privacy: From facial images to general images

Differentially private regression analysis with dynamic privacy allocation

Privacy of Dependent Users Against Statistical Matching

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Provable Privacy Guarantees Research Articles

Related Topics

Articles published on Provable Privacy Guarantees

Task-Agnostic Privacy-Preserving Representation Learning for Federated Learning against Attribute Inference Attacks

PKDGAN: Private Knowledge Distillation With Generative Adversarial Networks

DProvDB: Differentially Private Query Processing with Multi-Analyst Provenance

Trajectory Data Collection with Local Differential Privacy

Privacy-Preserving federated learning: An application for big data load forecast in buildings

Fuzzy Differential Privacy Theory and Its Applications in Subgraph Counting

Privacy-Preserving Database Fingerprinting.

Differentially private block coordinate descent

No Free Lunch Theorem for Security and Utility in Federated Learning

PADP-FedMeta: A personalized and adaptive differentially private federated meta learning mechanism for AIoT

Covariance’s Loss is Privacy’s Gain: Computationally Efficient, Private and Accurate Synthetic Data

Federating recommendations using differentially private prototypes

Fairness and Cost Constrained Privacy-Aware Record Linkage

Aggregation and Transformation of Vector-Valued Messages in the Shuffle Model of Differential Privacy

Preserving User Privacy in Personalized Networks

Secure DNA Motif-Finding Method Based on Sampling Candidate Pruning

A Comparison of Algorithms for Solving the Multiagent Simple Temporal Problem

Obfuscation of images via differential privacy: From facial images to general images

Differentially private regression analysis with dynamic privacy allocation

Privacy of Dependent Users Against Statistical Matching