Generation of Surrogates for De-Identification of Electronic Health Records.

Aipeng Chen,Chandini Nekkantti,Jitendra Jonnagaddala,Siaw‐Teng Liaw

doi:10.3233/shti190185

Abstract

Unstructured electronic health records are valuable resources for research. Before they are shared with researchers, protected health information needs to be removed from these unstructured documents to protect patient privacy. The main steps involved in removing protected health information are accurately identifying sensitive information in the documents and removing the identified information. To keep the documents as realistic as possible, the step of omitting sensitive information is often followed by replacement of identified sensitive information with surrogates. In this study, we present an algorithm to generate surrogates for unstructured electronic health records. We used this algorithm to generate realistic surrogates on a Health Science Alliance corpus, which is constructed specifically for the use of development of automated de-identification systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generation of Surrogates for De-Identification of Electronic Health Records.

Abstract

Talk to us

Similar Papers

More From: Studies in health technology and informatics

Lead the way for us

Journal: Studies in health technology and informatics	Publication Date: Aug 22, 2019
Citations: 4

Similar Papers

Electronic Health Records: Promises and Realities: Part III: Information Privacy and Accuracy: Zero and GIGO Won't Do
William B Millard
Annals of Emergency Medicine | VOL. 56
William B MillardWilliam B Millard
22 Sep 2010
Annals of Emergency Medicine | VOL. 56

An Empirical Test of GRUs and Deep Contextualized Word Representations on De-Identification.
Kahyun Lee ... Özlem Uzuner
Studies in health technology and informatics | VOL. 264
Kahyun Lee, et. al.Kahyun Lee ... Özlem Uzuner
21 Aug 2019
Studies in health technology and informatics | VOL. 264

An Extensible De-Identification Framework for Privacy Protection of Unstructured Health Information: Creating Sustainable Privacy Infrastructures.
Stefano Braghin ... Spiros Antonatos
Studies in health technology and informatics | VOL. 264
Stefano Braghin, et. al.Stefano Braghin ... Spiros Antonatos
21 Aug 2019
Studies in health technology and informatics | VOL. 264

Impact of De-Identification on Clinical Text Classification Using Traditional and Deep Learning Classifiers.
...
Studies in health technology and informatics | VOL. 264
, et. al. ...
21 Aug 2019
Studies in health technology and informatics | VOL. 264

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generation of Surrogates for De-Identification of Electronic Health Records.

Abstract

Talk to us

Similar Papers

More From: Studies in health technology and informatics