Abstract

Data mining entails the discovery of unexpected but reusable knowledge from large unorganized datasets. Among the many available data-mining algorithms, association rule mining (ARM) is very common. It was developed to aggregate all data into one site and subsequently mine them. In recent years, organizations in different fields have been required to collaborate to create new value. However, data mining among and within organizations has raised privacy and confidentiality concerns. In our scheme, parties cannot share anything other than the number of records, including the candidate itemset. This study focuses on the private-set intersection instead of the scalar product and shows that this intersection enables organizations to execute ARM on vertically partitioned data, allowing flexible information sharing while preserving privacy without increasing communication and computation costs. Furthermore, we focus on the fact that the number of protocol rounds among parties can be reduced and present three use cases in which the proposed scheme works more effectively than the existing schemes.

Highlights

  • The ubiquity of internet-of-things (IoT) devices and the people who use them has generated tremendous amounts of data worldwide

  • This study shows that the private-set intersection enables the execution of Association rule mining (ARM) on vertically partitioned data without changing communication and computation costs

  • 1: Distro stands for distribution. 2: frequent itemset mining (FIM) means Frequent Itemset Mining. 3: ARM means Association Rule Mining

Read more

Summary

INTRODUCTION

The ubiquity of internet-of-things (IoT) devices and the people who use them has generated tremendous amounts of data worldwide. Several schemes have been proposed to implement mining in distributed data environments These schemes have been broadly separated into those that use secure multi-party computation (SMC) and those that use cryptographic techniques. Many PPARM schemes [16,17,18,19,20,21,22,23,24,25] have been proposed to find frequent attribute sets within vertically distributed datasets without revealing private data. This study proposes a scheme in which ARM can be performed only by the data owners without outsourcing data storage and mining. The proposed ARM scheme (i.e., VC02) uses a scalar product for information sharing among parties [16]. This study shows that the private-set intersection enables the execution of ARM on vertically partitioned data without changing communication and computation costs

A COMPARISON OF MAIN FEATURES IN RELATED PPARM SCHEMES
Mining Results
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call