Matrix Decomposition Techniques for Data Privacy

Jun Zhang,Shuting Xu,Jie Wang

doi:10.4018/978-1-60566-010-3.ch185

Abstract

Data mining technologies have now been used in commercial, industrial, and governmental businesses, for various purposes, ranging from increasing profitability to enhancing national security. The widespread applications of data mining technologies have raised concerns about trade secrecy of corporations and privacy of innocent people contained in the datasets collected and used for the data mining purpose. It is necessary that data mining technologies designed for knowledge discovery across corporations and for security purpose towards general population have sufficient privacy awareness to protect the corporate trade secrecy and individual private information. Unfortunately, most standard data mining algorithms are not very efficient in terms of privacy protection, as they were originally developed mainly for commercial applications, in which different organizations collect and own their private databases, and mine their private databases for specific commercial purposes. In the cases of inter-corporation and security data mining applications, data mining algorithms may be applied to datasets containing sensitive or private information. Data warehouse owners and government agencies may potentially have access to many databases collected from different sources and may extract any information from these databases. This potentially unlimited access to data and information raises the fear of possible abuse and promotes the call for privacy protection and due process of law. Privacy-preserving data mining techniques have been developed to address these concerns (Fung et al., 2007; Zhang, & Zhang, 2007). The general goal of the privacy-preserving data mining techniques is defined as to hide sensitive individual data values from the outside world or from unauthorized persons, and simultaneously preserve the underlying data patterns and semantics so that a valid and efficient decision model based on the distorted data can be constructed. In the best scenarios, this new decision model should be equivalent to or even better than the model using the original data from the viewpoint of decision accuracy. There are currently at least two broad classes of approaches to achieving this goal. The first class of approaches attempts to distort the original data values so that the data miners (analysts) have no means (or greatly reduced ability) to derive the original values of the data. The second is to modify the data mining algorithms so that they allow data mining operations on distributed datasets without knowing the exact values of the data or without direct accessing the original datasets. This article only discusses the first class of approaches. Interested readers may consult (Clifton et al., 2003) and the references therein for discussions on distributed data mining approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Matrix Decomposition Techniques for Data Privacy

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Privacy Preserving Data Mining Techniques: Current Scenario and Future Prospects
Majid Bashir Malik ... M Asger Ghazi
-
Majid Bashir Malik, et. al.Majid Bashir Malik ... M Asger Ghazi
01 Nov 2012
01 Nov 2012

Research Progress on Software Engineering Data Mining Technology
Fengxian Deng
-
Fengxian DengFengxian Deng
01 Jan 2015
01 Jan 2015

A Survey: Privacy Preservation Techniques in Data Mining
Amit Ganatra ... Hina Vaghashia
International Journal of Computer Applications | VOL. 119
Amit Ganatra, et. al.Amit Ganatra ... Hina Vaghashia
18 Jun 2015
International Journal of Computer Applications | VOL. 119

Introduction to 3DM: Domain-Oriented Data-Driven Data Mining
Guoyin Wang
-
Guoyin WangGuoyin Wang
17 May 2008
17 May 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Matrix Decomposition Techniques for Data Privacy

Abstract

Talk to us

Similar Papers