Attribute Segregation based on Feature Ranking Framework for Privacy Preserving Data Mining

R Praveena Priyadarsini,M L Valarmathi,S Sivakumari

doi:10.17485/ijst/2015/v8i17/77584

R Praveena Priyadarsini, M L Valarmathi + Show 1 more

Open Access

https://doi.org/10.17485/ijst/2015/v8i17/77584

Copy DOI

Journal: Indian Journal of Science and Technology	Publication Date: Aug 6, 2015
Citations: 3	License type: cc-by

Abstract

Attributes in macro-data have to be segregating based on their sensitivity for privacy preservation purposes. Automating this attribute segregation becomes complicated in high dimensional datasets and data streams. In this work, information or correlation of the attribute on the target class attribute is measured using Information Gain [IG], Gain Ratio [GR] and Pearson Correlation [PC] ranker based feature selection methods and this values are used to segregate them as Sensitive Attributes [SA], Quasi Identifiers [QI] and Non-Sensitive [NS] Attributes. Segregated attributes are subjected to various levels of privacy preservation using both the proposed Double layer Perturbation [DLP] and Single Layer Perturbation [SLP] algorithms to form the level-1 perturbed datasets. The level-1 perturbed dataset is further perturbed by applying SLP algorithm to form level-2 and level-3 privacy preserved datasets. Thus, the multiple versions of Adult dataset created are distributed to data seekers based on their trust levels in Multi Trust Level [MTL] environment. The privacy preserved dataset versions created using the proposed algorithms are evaluated based on their utility, distortion and purity metrics. The results show that the ranker methods are able to identify attributes which had sensitive content as either SA or QI automatically and the proposed perturbed datasets have good utility on selected classification and clustering algorithms when compared to original and L-diversified datasets. Also, the distortion values of these datasets signify that they can prevent diversity attacks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attribute Segregation based on Feature Ranking Framework for Privacy Preserving Data Mining

Abstract

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology

Lead the way for us

Similar Papers

Subspace Clustering of High Dimensional Data Streams
Shuyun Wang ... Chenghong Zhang
-
Shuyun Wang, et. al.Shuyun Wang ... Chenghong Zhang
01 May 2008
01 May 2008

Privacy Preserving Data Publishing through Slicing
Shivani Rohilla
American Journal of Networks and Communications | VOL. 4
Shivani RohillaShivani Rohilla
01 Jan 2015
American Journal of Networks and Communications | VOL. 4

Privacy-MaxEnt
Wenliang Du ... Zutao Zhu
-
Wenliang Du, et. al.Wenliang Du ... Zutao Zhu
09 Jun 2008
09 Jun 2008

Attribute association based privacy preservation for multi trust level environment
R Praveena Priyadarsini ... S Sivakumari
Sadhana | VOL. 40
R Praveena Priyadarsini, et. al.R Praveena Priyadarsini ... S Sivakumari
01 Sep 2015
Sadhana | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attribute Segregation based on Feature Ranking Framework for Privacy Preserving Data Mining

Abstract

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology