Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.

Yaling Zhang,Jin Han

doi:10.1371/journal.pone.0248737

Yaling Zhang, Jin Han

Open Access

PDF Available

https://doi.org/10.1371/journal.pone.0248737

Copy DOI

Export

Save

Cite

Journal: PLOS ONE	Publication Date: Mar 23, 2021
Citations: 10	License type: CC BY 4.0

Affiliation: Xi'an University of Technology

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy protection can protect the user’s individual privacy while mining data rules, however, the decline in availability caused by data disturbances is a common problem of these algorithms. Aiming at the problem that the algorithm accuracy is reduced by randomly initializing the membership matrix of fuzzy C-means, in this paper, the maximum distance method is firstly used to determine the initial center point. Then, the gaussian value of the cluster center point is used to calculate the privacy budget allocation ratio. Additionally, Laplace noise is added to complete differential privacy protection. The experimental results demonstrate that the clustering accuracy and effectiveness of the proposed algorithm are higher than baselines under the same privacy protection intensity.

Highlights

Data mining is used to extract some potentially useful information from a large amount of valid information
In order to solve the above problems, this paper proposes a privacy budget allocation method based on the gaussian kernel function and applies it to the fuzzy Cmeans algorithm to ensure the availability of clustered data while solving the problem of privacy leakage
The core idea of this algorithm is that in the iteration of fuzzy C-means clustering, the privacy budget allocation method based on gaussian weight is adopted to realize differential privacy protection for each cluster center point

Summary

Introduction

Data mining is used to extract some potentially useful information from a large amount of valid information. In order to solve the above problems, this paper proposes a privacy budget allocation method based on the gaussian kernel function and applies it to the fuzzy Cmeans algorithm to ensure the availability of clustered data while solving the problem of privacy leakage. It provides a theoretical guarantee for users to use fuzzy C-means, which can promote the great research and wide application of fuzzy C-means in academic and industry. The fuzzy C-means algorithm main steps are: Input: dataset D 1⁄4 fxigni1⁄41, k Output: U and C

1: U is randomly initialized

17: Cbest Ct 18: return Cbest

Experimental setup

Evaluation metrics

Experimental results and analysis

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Child Health Dataset Publishing and Mining Based on Differential Privacy Preservation
Wenyu Li ... Siqi Wang
Mathematics | VOL. 12
Wenyu Li, et. al.Wenyu Li ... Siqi Wang
12 Aug 2024
Mathematics | VOL. 12

Low-cohesion differential privacy protection for industrial Internet
Jun Hou ... Sainan Zhang
The Journal of Supercomputing | VOL. 76
Jun Hou, et. al.Jun Hou ... Sainan Zhang
01 Jan 2020
The Journal of Supercomputing | VOL. 76

Differential Privacy Algorithm for Integrated Energy System Based on Improved K-means
Zhengquan Lv ... Yijun Chen
-
Zhengquan Lv, et. al.Zhengquan Lv ... Yijun Chen
17 Sep 2021
17 Sep 2021

A Differential Privacy Support Vector Machine Classifier Based on Dual Variable Perturbation
Yaling Zhang ... Shangping Wang
IEEE Access | VOL. 7
Yaling Zhang, et. al.Yaling Zhang ... Shangping Wang
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS ONE