Clustering-oriented privacy-preserving data publishing

Weiwei Ni,Zhihong Chong

doi:10.1016/j.knosys.2012.05.012

Abstract

Privacy-preserving data publishing has attracted considerable research interests in recent years. One of the problems in such practices is how to trade-off between data utility and privacy protection. This problem heavily deteriorates when the published data are used to do cluster analysis; clustering demands differences between singles for grouping while privacy preserving aims to hide single identifications. In this paper, a mixed mode data obfuscation method AENDO is proposed, which provides a tradeoff strategy from a novel view. The underlying principle is to keep nearest neighborhood structures of data points while data are obfuscated. In particular, for each data point, AENDO differentiates its attributes into neighboring dispersed attributes and neighboring concentrated ones. Furthermore, pertinent statistical data substitution and data swapping strategies are applied to these attributes, respectively. An extensive set of experiments on UCI data sets are provided to assess the effectiveness of our solution, including comparing AENDO with RBT which is one of the best methods on maintaining data usability for clustering. Our results demonstrate that AENDO behaves similarly with RBT on maintaining data utility for clustering, while it outperforms NeNDS by a factor of approximate 10%. Meanwhile, it delivers better anti-inferring effect compared with RBT and NeNDS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering-oriented privacy-preserving data publishing

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: May 31, 2012
Citations: 15

Similar Papers

Personal big data pricing method based on differential privacy
Yuncheng Shen ... Yuming Jiang
Computers & Security | VOL. 113
Yuncheng Shen, et. al.Yuncheng Shen ... Yuming Jiang
01 Nov 2021
Computers & Security | VOL. 113

Extreme-Centroid Tree for Outlier Detection
Panote Songwattanasiri ... Krung Sinapiromsaran
-
Panote Songwattanasiri, et. al.Panote Songwattanasiri ... Krung Sinapiromsaran
12 Nov 2015
12 Nov 2015

Personalized Trajectory Privacy Protection Method Based on User-Requirement
Zhaowei Hu ... Jianpei Zhang
International Journal of Cooperative Information Systems | VOL. 27
Zhaowei Hu, et. al.Zhaowei Hu ... Jianpei Zhang
01 Sep 2018
International Journal of Cooperative Information Systems | VOL. 27

DP-QIC: A differential privacy scheme based on quasi-identifier classification for big data publication
Si Chen ... Mang Su
Soft Computing | VOL. 25
Si Chen, et. al.Si Chen ... Mang Su
12 Mar 2021
Soft Computing | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering-oriented privacy-preserving data publishing

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems