Privacy Preserving Data Mining Using Random Decision Tree Over Partition Data: Survey

Nashwan Adnan Othman,Mustafa Zuhaer Nayef Al-Dabagh

doi:10.1051/itmconf/20224201010

Abstract

The development of data mining with data protection and data utility can manage distributed data efficiently. This paper revisits the concepts and techniques of privacy-preserving Random Decision Tree (RDT). In existing systems, cryptography-based techniques are effective at managing distributed information. Privacy-preserving RDT handles distributed information efficiently. Privacy-preserving RDT gives better precision data mining while preserving information and reducing the calculation time. This paper deals with this headway in privacy-preserving data mining technology utilizing emphasized approach of RDT. RDT gives preferable productivity and information privacy than cryptographic technique. Various data mining tasks utilize RDT, like classification, relapse, ranking, and different classifications. Privacy-preserving RDT utilizes both randomization and the cryptographic method, giving information privacy for some decision tree-based learning tasks; this is an effective technique for data mining with privacy-preserving distributed information. Thus, in horizontal partitioning of the dataset, parties gather information for various entities but have data for all attributes. On the other hand, various associations may gather different data about a similar set of people. Thus, in vertically partitioned data, all parties gather data for the same collection of items. In all of these cases, both horizontal and vertical partitioning of datasets is somewhat inaccurate.

Highlights

Data Mining finds exciting data patterns, and insights from extensive databases
Information privacy for various associations is paramount to expand their business since almost all organizations must share data without compromising privacy
This paper looks at randomization and cryptographic methods applied to sensitive information

Summary

Introduction

There are two phases in privacy-preserving data mining, the first is information collection, and the second is information publishing. Define a tree by randomly selecting a feature without utilizing any training information. RDT gives a better answer for the distributed data mining in concepts of privacypreserving because of these reasons; random formation of the tree gives more security because to get prior information, one should find the entire classification model and cases. Its proficiency is its ability to maintain privacy and accuracy yet lessen computation time compared to existing algorithms. It uses an Iterative Dichotomiser 3 (ID3) and Boosting algorithm within an RDT, including a privacy-preserving algorithm. Classification can be defined as storing information with similar features in the same class

Random Decision Trees Definition

Random Decision Trees Architecture

Privacy-Preserving Data Mining

Vertically Partition Data

Horizontally Partition Data

Privacy-preserving Random Decision Tree algorithm

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Privacy Preserving Data Mining Using Random Decision Tree Over Partition Data: Survey

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ITM Web of Conferences

Lead the way for us

Journal: ITM Web of Conferences	Publication Date: Jan 1, 2022
License type: CC BY 4.0

Similar Papers

Privacy Preserving Association Rule Mining in Vertically Partitioned Databases
K Sandhya Rani ... N V.Muthulakshmi
International Journal of Computer Applications | VOL. 39
K Sandhya Rani, et. al.K Sandhya Rani ... N V.Muthulakshmi
29 Feb 2012
International Journal of Computer Applications | VOL. 39

Data privacy in knowledge discovery
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

Fuzzy random decision tree (FRDT) framework for privacy preserving data mining
L Sumalatha ... P Uma Sankar
-
L Sumalatha, et. al.L Sumalatha ... P Uma Sankar
01 Jul 2016
01 Jul 2016

Communication-Efficient Hybrid Federated Learning for E-Health With Horizontal and Vertical Data Partitioning.
Hai Zhao ... Shiqiang Wang
IEEE transactions on neural networks and learning systems | VOL. PP
Hai Zhao, et. al.Hai Zhao ... Shiqiang Wang
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Privacy Preserving Data Mining Using Random Decision Tree Over Partition Data: Survey

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ITM Web of Conferences