Privacy-Preserving Data Mining of Medical Data Using Data Separation-Based Techniques

Gang Kou,Yi Peng,Yong Shi,Zhengxin Chen

doi:10.2481/dsj.6.s429

Abstract

The CODATA Data Science Journal is a peer-reviewed, open access, electronic journal, publishing papers on the management, dissemination, use and reuse of research data and databases across all research domains, including science, technology, the humanities and the arts. The scope of the journal includes descriptions of data systems, their implementations and their publication, applications, infrastructures, software, legal, reproducibility and transparency issues, the availability and usability of complex datasets, and with a particular focus on the principles, policies and practices for open data.All data is in scope, whether born digital or converted from other sources.

Highlights

Data mining or knowledge discovery in databases (KDD), which focuses on the extraction of useful knowledge from large amount of data, has steadily attracted researchers and practitioners from various fields
Privacy-preservation is an important issue in medical data mining
This paper investigates data separation techniques in medical data classification

Summary

INTRODUCTION

Data mining or knowledge discovery in databases (KDD), which focuses on the extraction of useful knowledge from large amount of data, has steadily attracted researchers and practitioners from various fields. As early as 1989, when the first KDD workshop was held in Detroit, Michigan, privacy issues have been brought up This is an especially important issue in medical data mining. The objective of this paper is to apply data separation-based techniques to preserve privacy in classification of medical data. In the vertical partition approach, each site uses a portion of the attributes to compute its results, and the distributed results are assembled at a central trusted party using majority-vote ensemble method. Each site computes its own data, and a central trusted party is responsible to integrate these results We implement these two approaches using two medical datasets from UCI Machine Learning repository: Wisconsin prognostic breast cancer dataset and heart-disease dataset. The section explains why and how we use vertical and horizontal separation techniques to protect privacy of medical data.

PRIVACY-PRESERVING MEDICAL DATA MINING

Horizontal Data Separation Experiment

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Science Journal	Publication Date: Jan 1, 2007
Citations: 18	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Privacy-Preserving Data Mining of Medical Data Using Data Separation-Based Techniques

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science Journal

Lead the way for us

Similar Papers

Possibility of Integrated Data Mining of Clinical Data
Akinori Abe ... Michiko Furutani
Data Science Journal | VOL. 6
Akinori Abe, et. al.Akinori Abe ... Michiko Furutani
01 Jan 2007
Data Science Journal | VOL. 6

Open access and biodiversity conservation: challenges and potentials for the developing world
Jitendra Gaikwad ... Vishwas Chavan
Data Science Journal | VOL. 5
Jitendra Gaikwad, et. al.Jitendra Gaikwad ... Vishwas Chavan
01 Jan 2006
Data Science Journal | VOL. 5

The impact of data mining techniques on medical diagnostics
Siri Krishan Wasan ... Vasudha Bhatnagar
Data Science Journal | VOL. 5
Siri Krishan Wasan, et. al.Siri Krishan Wasan ... Vasudha Bhatnagar
01 Jan 2006
Data Science Journal | VOL. 5

Discovery of Teleconnections Using Data Mining Technologies in Global Climate Datasets
Fan Lin ... Cheng Hu
Data Science Journal | VOL. 6
Fan Lin, et. al.Fan Lin ... Cheng Hu
01 Jan 2007
Data Science Journal | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Privacy-Preserving Data Mining of Medical Data Using Data Separation-Based Techniques

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science Journal