Locally Private High-Dimensional Crowdsourced Data Release Based on Copula Functions

Teng Wang,Wei Yu,Shusen Yang,Xuebin Ren,Xinyu Yang

doi:10.1109/tsc.2019.2961092

Abstract

With the increasing popularity of crowdsourcing services, high-dimensional crowdsourced data provides a wealth of knowledge. Nonetheless, unprecedented privacy threats to participants have emerged, due to complex correlations among multiple attributes and the vulnerabilities of untrusted crowdsourcing servers. Differential privacy-based paradigms have been proposed to release privacy-preserving datasets with statistical approximation. Nonetheless, most existing schemes are limited when facing highly correlated attributes, and cannot prevent privacy threats from untrusted crowdsourcing servers. To address this issue, we propose two novel solutions, namely <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">LoCop and <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">DR_LoCop , which guarantee local differential privacy based on the randomized response technique while synthesizing and releasing high-dimensional crowdsourced data with high data utility. Particularly, <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">LoCop leverages copula theory to synthesize high-dimensional crowdsourced data via univariate marginal distribution and attribute dependence. Univariate marginal distribution is estimated by the Lasso-based regression algorithm from aggregated privacy-preserving bit strings. Dependencies among attributes are modeled as multivariate Gaussian copula. Based on <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">LoCop , the enhanced solution <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">DR_LoCop not only takes advantage of C-vine copula to reflect conditional dependencies among high-dimensional attributes, but also achieves dimension reduction. Extensive experiments on real-world datasets demonstrate that our solutions substantially outperform the state-of-the-art techniques in terms of both data utility and computational overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Locally Private High-Dimensional Crowdsourced Data Release Based on Copula Functions

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Services Computing

Lead the way for us

Journal: IEEE Transactions on Services Computing	Publication Date: Jan 10, 2020
Citations: 23

Similar Papers

Copula-Based Multi-Dimensional Crowdsourced Data Synthesis and Release with Local Privacy
Xinyu Yang ... Teng Wang
-
Xinyu Yang, et. al.Xinyu Yang ... Teng Wang
01 Dec 2017
01 Dec 2017

Local Differential Privacy Protection of High-Dimensional Perceptual Data by the Refined Bayes Network.
Chunhua Ju ... Gongxing Wu
Sensors | VOL. 20
Chunhua Ju, et. al.Chunhua Ju ... Gongxing Wu
29 Apr 2020
Sensors | VOL. 20

Bivariate copulas functions for flood frequency analysis
Norizzati Salleh ... Fadhilah Yusof
-
Norizzati Salleh, et. al.Norizzati Salleh ... Fadhilah Yusof
01 Jan 2015
01 Jan 2015

Differential Privacy-Based Location Protection in Spatial Crowdsourcing
Jianhao Wei ... Jin Zhang
IEEE Transactions on Services Computing | VOL. 15
Jianhao Wei, et. al.Jianhao Wei ... Jin Zhang
01 Jan 2021
IEEE Transactions on Services Computing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Locally Private High-Dimensional Crowdsourced Data Release Based on Copula Functions

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Services Computing