Prevent Identity Disclosure Research Articles

Digital health data collection is vital for healthcare and medical research. But it contains sensitive information about patients, which makes it challenging. To collect health data without privacy breaches, it must be secured between the data owner and the collector. Existing data collection research studies have too stringent assumptions such as using a third-party anonymizer or a private channel amid the data owner and the collector. These studies are more susceptible to privacy attacks due to third-party involvement, which makes them less applicable for privacy-preserving healthcare data collection. This article proposes a novel privacy-preserving data collection protocol that anonymizes healthcare data without using a third-party anonymizer or a private channel for data transmission. A clustering-based k-anonymity model was adopted to efficiently prevent identity disclosure attacks, and the communication between the data owner and the collector is restricted to some elected representatives of each equivalent group of data owners. We also identified a privacy attack, known as "leader collusion", in which the elected representatives may collaborate to violate an individual's privacy. We propose solutions for such collisions and sensitive attribute protection. A greedy heuristic method is devised to efficiently handle the data owners who join or depart the anonymization process dynamically. Furthermore, we present the potential privacy attacks on the proposed protocol and theoretical analysis. Extensive experiments are conducted in real-world datasets, and the results suggest that our solution outperforms the state-of-the-art techniques in terms of privacy protection and computational complexity.

Read full abstract

The development of several popular social networks and the publication of social networks’ data have led to the risk of leakage of sensitive and confidential information of individuals. This requires the preservation of privacy before the publication of a user’s data available from his Online Social Network (OSN) presence. Numerous algorithms have been proposed in the area of preserving the privacy of social network users’ information such as K-anonymity and L-diversity. Previous work has shown good results based on the concept of adding edges and noise nodes for achieving K-anonymity and L-diversity. K-anonymization techniques are able to prevent identity disclosure of users but are not sufficient to prevent the disclosure of sensitive information of users. In this direction, a number of techniques for preserving the sensitive information of social network users have been proposed. Although these techniques have shown reasonably good results to achieve anonymity, but they also lead to a substantial change in the original structure of the OSNs. In this article, the problems of preventing sensitive attribute disclosure and reducing the noisy nodes have been addressed by perturbing the sensitive attributes. Existing research uses L-diversity for preventing sensitive attribute disclosure resulting in skewness and similarity attacks. We have addressed the skewness attacks by removing the duplicate noisy nodes from the final dataset to be published for stakeholders by the OSN service providers. All the information of duplicate nodes has been stored in a table named Reference Attribute Table (RAT). This table will be accessible only to the service providers for the purpose of de-anonymizing the data of users. The proposed technique has been extensively evaluated using five metrics viz. APL, ACSPL, RRTI, number of noisy nodes, and information loss using four real-time datasets collected for OSNs namely CORA, ARNET, DBLP, and Twitter. Results of evaluation parameters viz. APL and RRTI show that there is less change in the structure of datasets after anonymization. Results of ACSPL show that our proposed technique is able to preserve sensitive attributes in the datasets. The maximum number of noisy nodes amongst all four datasets is 5.4% and the maximum information loss is 2.2%. Evaluation results make it evident that our proposed technique ensures privacy preservation with less loss of information and thus preserving the utility of published data.

Read full abstract

Prevent Identity Disclosure Research Articles

Articles published on Prevent Identity Disclosure

Enhancing Utility in Anonymized Data against the Adversary’s Background Knowledge

An anonymization-based privacy-preserving data collection protocol for digital health data.

Local generalization and bucketization technique for personalized privacy preservation

Social Networks Privacy Preservation: A Novel Framework

Privacy Preservation in Resource-Constrained IoT Devices Using Blockchain—A Survey

Practical anonymity models on protecting private weighted graphs

Preventing Identity Disclosure in Social Networks Using Intersected Node

De-Identification of Health Data in Big Data using a Novel Bio-Inspired Apoptosis Algorithm

Anonymizied Approach to Preserve Privacy of Published Data Through Record Elimination

Protecting Privacy Against Record Linkage Disclosure: A Bounded Swapping Approach for Numeric Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Prevent Identity Disclosure Research Articles

Articles published on Prevent Identity Disclosure

Enhancing Utility in Anonymized Data against the Adversary’s Background Knowledge

An anonymization-based privacy-preserving data collection protocol for digital health data.

Local generalization and bucketization technique for personalized privacy preservation

Social Networks Privacy Preservation: A Novel Framework

Privacy Preservation in Resource-Constrained IoT Devices Using Blockchain—A Survey

Practical anonymity models on protecting private weighted graphs

Preventing Identity Disclosure in Social Networks Using Intersected Node

De-Identification of Health Data in Big Data using a Novel Bio-Inspired Apoptosis Algorithm

Anonymizied Approach to Preserve Privacy of Published Data Through Record Elimination

Protecting Privacy Against Record Linkage Disclosure: A Bounded Swapping Approach for Numeric Data