On Anonymizing Medical Microdata with Large-Scale Missing Values - A Case Study with the FAERS Dataset.

Mei-Hui Hsiao,Kuang-Yung Hsu,Zih-Xun Shen,Wen-Yang Lin

doi:10.1109/embc.2019.8857025

Abstract

As big data analysis becomes one of the main driving forces for productivity and economic growth, the concern of individual privacy disclosure increases as well, especially for applications accessing medical or health data that contain personal information. Most contemporary techniques for privacy preserving data publishing follow a simple assumption-the data of concern is complete, i.e., containing no missing values, which however is not the case in the real world. This paper presents our endeavors on inspecting the effect of missing values upon medical data privacy. In particular, we inspected the US FAERS dataset, a public dataset containing adverse drug events released by US FDA. Following the presumption of current anonymization paradigm-the data should contain no missing values, we investigated three intuitive strategies, including or excluding missing values or executing imputation, to anonymize the FAERS dataset. Our results demonstrate the awkwardness of these intuitive strategies in handling data with a massive amount of missing values. Accordingly, we propose a new strategy, consolidation, and the corresponding privacy protection model and anonymization algorithm. Experimental results show that our method can prevent privacy disclosure and sustain the data utility for ADR signal detection.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On Anonymizing Medical Microdata with Large-Scale Missing Values - A Case Study with the FAERS Dataset.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Lead the way for us

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference	Publication Date: Jul 1, 2019
Citations: 21

Similar Papers

Legal Governance of Brain Data Derived from Artificial Intelligence
Mahika Ahluwalia
Voices in bioethics | VOL. 7
Mahika AhluwaliaMahika Ahluwalia
02 Jun 2021
Voices in bioethics | VOL. 7

A Privacy Security Risk Analysis Method for Medical Big Data in Urban Computing
Rong Jiang ... Mingyue Shi
IEEE access : practical innovations, open solutions | VOL. 7
Rong Jiang, et. al.Rong Jiang ... Mingyue Shi
01 Jan 2019
IEEE access : practical innovations, open solutions | VOL. 7

MNSSp3: Medical big data privacy protection platform based on Internet of things
Xiang Wu ... Yongting Zhang
Neural Computing & Applications | VOL. 34
Xiang Wu, et. al.Xiang Wu ... Yongting Zhang
23 May 2020
Neural Computing & Applications | VOL. 34

The big health data sale: As the trade of personal health and medical data expands, it becomes necessary to improve legal frameworks for protecting patient anonymity, handling consent and ensuring the quality of data.
Philip Hunter
EMBO Reports | VOL. 17
Philip HunterPhilip Hunter
11 Jul 2016
EMBO Reports | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Anonymizing Medical Microdata with Large-Scale Missing Values - A Case Study with the FAERS Dataset.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference