Abstract

Truth discovery is an effective way to identify the aggregated truth of each task among multiple observed data drawn from different workers of varying reliabilities. However, existing studies are insufficient to protect individuals’ privacy, as they either just guarantee the weaker versions of local differential privacy (LDP) or potentially assume that the tasks are independent. In this paper, we, for the first time, investigate the problem of truth discovery while achieving the rigorous LDP for each worker with continuous inputs without the independence assumption. We present a locally differentially private truth discovery approach called <i>PrivTDSI</i> based on sampling and inference with solid privacy and utility guarantees. In <i>PrivTDSI</i> , the server first determines which values of each worker should be sampled according to a sample proportion and sends the indexes of these values to each worker. Then, each worker adds noise into the sampled values for privacy protection and uploads them to the server. After receiving the noisy sampled values from all the workers, the server first infers the unsampled values and then conducts truth discovery based on both the noisy sampled values and the inferred values. In particular, to determine the sample proportion, we formulate a <i>constrained nonlinear programming</i> problem and give a closed-form solution to this problem. Moreover, to determine which values of each worker should be sampled while avoiding the situation where the values of some workers or tasks might not be sampled at all, we develop a two-stage sampling method called <i>TOSS</i> . Furthermore, to infer the unsampled values accurately, we design a quality-aware inference method based on matrix factorization called <i>QualityMF</i> . Experimental results on two real-world datasets and a synthetic dataset demonstrate the effectiveness of <inline-formula><tex-math notation="LaTeX">${PrivTDSI}$</tex-math></inline-formula> .

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call