Multi-Party Verifiable Privacy-Preserving Federated k-Means Clustering in Outsourced Environment

Ruiqi Hou,Fei Tang,Guowei Ling,Shikai Liang

doi:10.1155/2021/3630312

Abstract

As a commonly used algorithm in data mining, clustering has been widely applied in many fields, such as machine learning, information retrieval, and pattern recognition. In reality, data to be analyzed are often distributed to multiple parties. Moreover, the rapidly increasing data volume puts heavy computing pressure on data owners. Thus, data owners tend to outsource their own data to cloud servers and obtain data analysis results for the federated data. However, the existing privacy-preserving outsourced k -means schemes cannot verify whether participants share consistent data. Considering the scenarios with multiple data owners and sensitive information security in an outsourced environment, we propose a verifiable privacy-preserving federated k -means clustering scheme. In this article, cloud servers and participants perform k -means clustering algorithm over encrypted data without exposing private data and intermediate results in each iteration. In particular, our scheme can verify the shares from participants when updating the cluster centers based on secret sharing, hash function and blockchain, so that our scheme can resist inconsistent share attacks by malicious participants. Finally, the security and experimental analysis are carried out to show that our scheme can protect private data and get high-accuracy clustering results.

Highlights

Data mining technology can be used to analyze and extract potentially valuable information from large collections of data
As a wellknown clustering algorithm, k-means clustering [3] algorithm has the advantages of simple process and good clustering results and it can assign data into k clusters based on the distances from cluster centers
We propose a multi-party verifiable privacy-preserving federated k-means scheme for horizontally partitioned data

Summary

Introduction

Data mining technology can be used to analyze and extract potentially valuable information from large collections of data. Vaidya and Clifton [10] firstly proposed the multi-party privacy-preserving k-means clustering protocol on vertically partitioned data, where the secure distance computation and comparison are supported by the secure permutation scheme and homomorphic encryption. Liu et al [23], following the framework in [24], presented a privacy-preserving outsourced k-means clustering protocol that one party outsourced the distance computation to a cloud server without revealing both the data and clustering results to any party and cloud server. Jiang et al [25] introduced an efficient two-party privacy-preserving k-means clustering protocol, and this scheme can compute distance safely using subprotocols in [26] and update cluster centers using garbled circuit proposed in [27]. We propose a multi-party verifiable privacy-preserving federated k-means scheme for horizontally partitioned data.

Preliminaries

Participants

Our Construction

Step 1

Step 2

Step 3

Security Analysis

Performance Analysis

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Security and Communication Networks	Publication Date: Dec 28, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-Party Verifiable Privacy-Preserving Federated k-Means Clustering in Outsourced Environment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Security and Communication Networks

Lead the way for us

Similar Papers

Review of the state-of-the-art methods for Privacy Preserved Classification in Outsourced Environment
Vijayendra Sanjay Gaikwad ... V M Thakare
-
Vijayendra Sanjay Gaikwad, et. al.Vijayendra Sanjay Gaikwad ... V M Thakare
01 Feb 2020
01 Feb 2020

An Extensive Review and Possible Attack on the Privacy Preserving Ranked Multi-Keyword Search for Multiple Data Owners in Cloud Computing
N Deepa ... P Vijayakumar
-
N Deepa, et. al.N Deepa ... P Vijayakumar
01 Nov 2017
01 Nov 2017

Secure access of multiple keywords over encrypted data in cloud environment using ECC-PKI and ECC ElGamal
Sourabh Prakash ... Nitish Andola
-
Sourabh Prakash, et. al.Sourabh Prakash ... Nitish Andola
01 Nov 2017
01 Nov 2017

Asymptotically Optimal and Secure Multiwriter/Multireader Similarity Search
Hyunsoo Kwon ... Changhee Hahn
IEEE Access | VOL. 10
Hyunsoo Kwon, et. al.Hyunsoo Kwon ... Changhee Hahn
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Party Verifiable Privacy-Preserving Federated k-Means Clustering in Outsourced Environment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Security and Communication Networks