Differential privacy based classification model for mining medical data stream using adaptive random forest

Hayder K Fatlawi,Attila Kiss

doi:10.2478/ausi-2021-0001

Abstract

Abstract Most typical data mining techniques are developed based on training the batch data which makes the task of mining the data stream represent a significant challenge. On the other hand, providing a mechanism to perform data mining operations without revealing the patient’s identity has increasing importance in the data mining field. In this work, a classification model with differential privacy is proposed for mining the medical data stream using Adaptive Random Forest (ARF). The experimental results of applying the proposed model on four medical datasets show that ARF mostly has a more stable performance over the other six techniques.

Highlights

A series of researches and projects in medical science, and information technology (IT) are starting a relationship between the healthcare industry and the IT industry that rapidly leads to a better and interactive relation among patients, their doctors, and health institutions
Zhang et al [22] used two mechanisms of noise: Laplace and exponential for providing privacy. They utilized lower noise sensitivity to avoid a high impact on split point choosing. They applied the proposed model on only one dataset, and the results showed more stability in classification accuracy compared with three other algorithms
Stream data faces many constraints as follow: (1) infinite arrival of data samples make storing them impossible, (2) the fast arrival of data samples requires dealing with each sample in real-time, (3) the possibility of changing items’ distribution overtime in which the old data would be useless for the current status

Summary

Introduction

A series of researches and projects in medical science, and information technology (IT) are starting a relationship between the healthcare industry and the IT industry that rapidly leads to a better and interactive relation among patients, their doctors, and health institutions. Computing Classification System 1998: H.2.8, I.2.1 Mathematics Subject Classification 2010: 68P25, 97R40 Key words and phrases: ensemble methods, bagging, privacy-preserving protocol. One of the most remarkable challenges facing data mining is privacy preservation. Privacy is an important component of medical data processing, as many health institutions refrain from providing this data to the public, due to the fear of compromising patient privacy. Providing a mechanism to carry out data mining operations, without revealing the patient’s identity has recently taken place in the interest of researchers

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Acta Universitatis Sapientiae, Informatica	Publication Date: Jun 1, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Differential privacy based classification model for mining medical data stream using adaptive random forest

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Acta Universitatis Sapientiae, Informatica

Lead the way for us

Similar Papers

Improving the Efficiency of Ensemble Classifier Adaptive Random Forest with Meta Level Learning for Real-Time Data Streams
Monika Arya ... Chaitali Choudhary
-
Monika Arya, et. al.Monika Arya ... Chaitali Choudhary
01 Jan 2020
01 Jan 2020

Adaptive random forests for evolving data stream classification
Heitor M Gomes ... Talel Abdessalem
Machine Learning | VOL. 106
Heitor M Gomes, et. al.Heitor M Gomes ... Talel Abdessalem
13 Jun 2017
Machine Learning | VOL. 106

Recurring concept meta-learning for evolving data streams
Robert Anderson ... Albert Bifet
Expert Systems with Applications | VOL. 138
Robert Anderson, et. al.Robert Anderson ... Albert Bifet
20 Jul 2019
Expert Systems with Applications | VOL. 138

Measuring the Effectiveness of Adaptive Random Forest for Handling Concept Drift in Big Data Streams.
Abdulaziz O Alqabbany ... Aqil M Azmi
Entropy | VOL. 23
Abdulaziz O Alqabbany, et. al.Abdulaziz O Alqabbany ... Aqil M Azmi
04 Jul 2021
Entropy | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Differential privacy based classification model for mining medical data stream using adaptive random forest

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Acta Universitatis Sapientiae, Informatica