Abstract

Online Social Network (OSN) data is often collected by the third parties for various purposes. One of the problems in such practices is how to measure the privacy breach to assure secure users. However, the recent works on privacy estimation are not systematic enough and are mainly focus on the traditional datasets, such as bank data and hospital data. Compared with these closed environments, the open APIs and lower register barriers make OSNs an open environment. Thus the openness of OSN makes more User Generated Content (UGC) like blogs and remarks be achieved easily by adversaries. In this paper, we analyzed the background knowledge in OSNs and proposed a general privacy estimation model facing OSNs data based on linear regression. In particular, our model takes the content knowledge of adversary into consideration. Considered the high dimension of content knowledge, which could cause high computational overhead, we optimized our model by Principal Component Analysis (PCA).

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call