Utility-Based Anonymization Using Generalization Boundaries to Protect Sensitive Attributes

Abou-El-Ela Abdou Hussien,Nagy Ramadan Darwish,Hesham A Hefny

doi:10.4236/jis.2015.63019

Abou-El-Ela Abdou Hussien, Nagy Ramadan Darwish + Show 1 more

Open Access

https://doi.org/10.4236/jis.2015.63019

Copy DOI

Journal: Journal of Information Security	Publication Date: Jan 1, 2015
Citations: 20	License type: CC BY 4.0

Affiliation: Cairo University

Abstract

Privacy preserving data mining (PPDM) has become more and more important because it allows sharing of privacy sensitive data for analytical purposes. A big number of privacy techniques were developed most of which used the k-anonymity property which have many shortcomings, so other privacy techniques were introduced (l-diversity, p-sensitive k-anonymity, (α, k)-anonymity, t-closeness, etc.). While they are different in their methods and quality of their results, they all focus first on masking the data, and then protecting the quality of the data. This paper is concerned with providing an enhanced privacy technique that combines some anonymity techniques to maintain both privacy and data utility by considering the sensitivity values of attributes in queries using sensitivity weights which determine taking in account utility-based anonymization and then only queries having sensitive attributes whose values exceed threshold are to be changed using generalization boundaries. The threshold value is calculated depending on the different weights assigned to individual attributes which take into account the utility of each attribute and those particular attributes whose total weights exceed the threshold values is changed using generalization boundaries and the other queries can be directly published. Experiment results using UT dallas anonymization toolbox on real data set adult database from the UC machine learning repository show that although the proposed technique preserves privacy, it also can maintain the utility of the publishing data.

Highlights

Many organizations collect and hold very large volumes of data like hospitals, credit card companies, real estate companies and search engines
Second: Modified Data Model According to our Utility-Based Anonymization Using Generalization Boundaries to protect Sensitive Attributes Depending on Attributes Sensitivity Weights developed in this paper
As shown in the related work, many techniques have been developed to automatically determine which part of database needs scrambling; others have been developed to scrambling database using generalization and suppression at all which leads to very height information loss and others making scrambling using generalization boundaries

Summary

Introduction

Many organizations collect and hold very large volumes of data like hospitals, credit card companies, real estate companies and search engines. They would like to publish the data for the purposes of data mining. Data mining is a technique for automatically and intelligently extracting information or knowledge from very large amount of data [1] [2]. When these data are released, it contains a lot of sensitive information. Most techniques use some form of transformation on the original data in order to maintain the privacy preservation. The transformed dataset could be available for mining and must achieve privacy requirements without affecting the mining benefits

Related Research Areas

K-Anonymity Technique

Constrained K-Anonymity

Sensitivity-Based Anonymity

Utility-Based Anonymization

UT DALLAS Anonymization Toolbox

26 K 33 k 66 k 34 k

Data Utility Measures

Proposed Technique

Main Procedures

Proposed Algorithm

Experimental Results and Analysis

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Utility-Based Anonymization Using Generalization Boundaries to Protect Sensitive Attributes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Information Security

Lead the way for us

Similar Papers

PrivGuard: Sensitivity Guided Anonymization based PPDM with Automatic Selection of Sensitive Attributes
V. Uma Rani ... M. Sreenivasa Rao
-
V. Uma Rani, et. al.V. Uma Rani ... M. Sreenivasa Rao
19 Mar 2021
19 Mar 2021

Transformation approach for boolean attributes in privacy preserving data mining
Rupinder Kaur ... Meenakshi Bansal
-
Rupinder Kaur, et. al.Rupinder Kaur ... Meenakshi Bansal
01 Sep 2015
01 Sep 2015

Sensitive attribute based non-homogeneous anonymization for privacy preserving data mining
P Usha ... S Sathishkumar
-
P Usha, et. al.P Usha ... S Sathishkumar
01 Feb 2014
01 Feb 2014

HiMod-Pert: Histogram Modification Based Perturbation Approach for Privacy Preserving Data Mining
... Ravi Gulati
-
, et. al. ... Ravi Gulati
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Utility-Based Anonymization Using Generalization Boundaries to Protect Sensitive Attributes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Information Security