Optimizing PPDM in asynchronous sparse data using random projection

R Raja Kumar,G.V Uma,J Indumathi

doi:10.1109/iri.2008.4583066

Abstract

Privacy is fetching a progressively more imperative issue in several data-mining applications dealing with sensitive data especially in health care, security, financial, behavioral etc., Most of the existing techniques are managing a Secure Two-Party Computation model, where two parties, each having a private database, want to cooperatively conduct data-mining operations on the union of their data. The problem we are pinning down for Privacy Preserving Data Mining(PPDM), is how a data owner can release a version of its confidential data with guarantees that the original sensitive information cannot be re-identified while the analytic properties of the data are preserved. In this paper we work to investigate the leeway of using multiplicative random projection sparse matrices for privacy preserving data in datasets which gets incremented asynchronously over time from various sources. The data stream is asynchronous. This work proposes the use of random projections with a sparse matrix to maintain a sketch of a collection of high-dimensional data-streams that are updated asynchronously. This sketch allows us to estimate L2 (Euclidean) distances and dot products with high accuracy. We have also proposed a conceptual architecture for implementing the privacy preservation techniques especially the Sparse Random Projection Matrix technique in incremental data to improve the level of privacy protection. We have tested to see that the perturbed data still preserves certain statistical characteristics of the data as the original unperturbed data. At this juncture we have proposed a generic projection based sketch for incremental data stream which can be used not only for this application but also can be used for any other applications, which supports incremental data bases. We have traced the origin of PPDM, the definition of privacy preservation in data mining, and the implications of benchmark privacy doctrine in information detection and advocate a few policies for PPDM based on these privacy principles. These are vital for the development and deployment of methodological solutions. This will let vendors and developers to construct unyielding information reuse and integration (IRI) in PPDM. We pursue to capitalize on the reuse of PPDM information by crafting easy, affluent, and reusable knowledge depictions and accordingly investigates tactics for amalgamate this knowledge into heritage systems and make advances in the upcoming of PPDM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimizing PPDM in asynchronous sparse data using random projection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improved Bounds on the Dot Product under Random Projection and Random Sign Projection
Ata Kaban
-
Ata KabanAta Kaban
10 Aug 2015
10 Aug 2015

Efficient Model for Privacy Preserving Classification Of Data Streams
P Rajendra Prasad, Et Al
Turkish Journal of Computer and Mathematics Education (TURCOMAT) | VOL. 12
P Rajendra Prasad, Et AlP Rajendra Prasad, Et Al
11 Apr 2021
Turkish Journal of Computer and Mathematics Education (TURCOMAT) | VOL. 12

A Comparison of the Effects of K-Anonymity on Machine Learning Algorithms
Hayden Wimmer ... Loreen Powell
International Journal of Advanced Computer Science and Applications | VOL. 5
Hayden Wimmer, et. al.Hayden Wimmer ... Loreen Powell
01 Jan 2014
International Journal of Advanced Computer Science and Applications | VOL. 5

Privacy preservation in data mining using hybrid perturbation methods: an application to bankruptcy prediction in banks
Kunta Ramu ... V Ravi
International Journal of Data Analysis Techniques and Strategies | VOL. 1
Kunta Ramu, et. al.Kunta Ramu ... V Ravi
01 Jan 2009
International Journal of Data Analysis Techniques and Strategies | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimizing PPDM in asynchronous sparse data using random projection

Abstract

Talk to us

Similar Papers