A Novel Privacy Disclosure Risk Measure and Optimizing Privacy Preserving Data Publishing Techniques

Marmar Orooji

doi:10.31390/gradschool_dissertations.5013

Abstract

A tremendous amount of individual-level data is generated each day, with a wide variety of uses. This data often contains sensitive information about individuals, which can be disclosed by “adversaries”. Even when direct identifiers such as social security numbers are masked, an adversary may be able to recognize an individual's identity for a data record by looking at the values of quasi-identifiers (QID), known as identity disclosure, or can uncover sensitive attributes (SA) about an individual through attribute disclosure. In data privacy field, multiple disclosure risk measures have been proposed. These share two drawbacks: they do not consider identity and attribute disclosure concurrently, and they make restrictive assumptions on an adversary's knowledge and disclosure target by assuming certain attributes are QIDs and SAs with clear boundary in between. In this study, we present a Flexible Adversary Disclosure Risk (FADR) measure that addresses these limitations, by presenting a single combined metric of identity and attribute disclosure, and considering all scenarios for an adversary’s knowledge and disclosure targets while providing the flexibility to model a specific disclosure preference. In addition, we employ FADR measure to develop our novel “RU Generalization” algorithm that anonymizes a sensitive dataset to be able to publish the data for public access while preserving the privacy of individuals in the dataset. The challenge is to preserve privacy without incurring excessive information loss. Our RU Generalization algorithm is a greedy heuristic algorithm, which aims at minimizing the combination of both disclosure risk and information loss, to obtain an optimized anonymized dataset. We have conducted a set of experiments on a benchmark dataset from 1994 Census database, to evaluate both our FADR measure and RU Generalization algorithm. We have shown the robustness of our FADR measure and the effectiveness of our RU Generalization algorithm by comparing with the benchmark anonymization algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Privacy Disclosure Risk Measure and Optimizing Privacy Preserving Data Publishing Techniques

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Privacy and confidentiality management for the microaggregation disclosure control method
Traian Marius Truta ... Farshad Fotouhi
-
Traian Marius Truta, et. al.Traian Marius Truta ... Farshad Fotouhi
30 Oct 2003
30 Oct 2003

Privacy Preserving Data Publishing through Slicing
Shivani Rohilla
American Journal of Networks and Communications | VOL. 4
Shivani RohillaShivani Rohilla
01 Jan 2015
American Journal of Networks and Communications | VOL. 4

A Measure of Disclosure Risk for Microdata
C J Skinner ... M J Elliot
Journal of the Royal Statistical Society Series B: Statistical Methodology | VOL. 64
C J Skinner, et. al.C J Skinner ... M J Elliot
01 Oct 2002
Journal of the Royal Statistical Society Series B: Statistical Methodology | VOL. 64

Measuring Disclosure Risk with Entropy in Population Based Frequency Tables
Laszlo Antal ... Mark Elliot
-
Laszlo Antal, et. al.Laszlo Antal ... Mark Elliot
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Privacy Disclosure Risk Measure and Optimizing Privacy Preserving Data Publishing Techniques

Abstract

Talk to us

Similar Papers