Carrell et al. Respond to "Observational Research and the EHR"

D S Carrell,J Chubak,S Halgrim,D S M Buist,D.-T Tran,G Savova,W W Chapman

doi:10.1093/aje/kwt444

Abstract

We appreciate Dr. Chute's thoughtful comments and perspectives on the broader landscape of clinical informatics (1). Three points he makes regarding clinical natural language processing (NLP) of electronic health record (EHR) data have important implications for future research. First, the availability of clinical text for secondary use is a critical first step toward realizing its potential. As Dr. Chute notes, we are witnessing a transition from a past constrained by data to a future that may be overwhelmed by it. Yet in most institutions, researcher access to EHR text lags far behind access to structured data. Barriers to accessing clinical text tend to be less about technical challenges than about perceptions—we would argue, misperceptions—of risks to patient privacy once text is made available outside the EHR. Correcting potential misperceptions is a critical first step in improving access. Difficulties in accessing EHR text across multiple institutions is an additional challenge. Health information exchange addresses this issue from the perspective of care delivery but does not necessarily address aggregating the clinical narrative for research. Longitudinally incomplete records, as Dr. Chute points out, retard algorithm performance. Our own work illustrates this. Four of our NLP algorithm's 5 case identification failures were due to the absence of electronic versions of relevant chart notes. Overcoming access challenges would benefit observational investigations from small to large. Consider, for example, the potential impact on postmarket drug safety surveillance through the Food and Drug Administration's Sentinel Initiative (2, 3) if structured data from the more than 130 million covered lives from 18 organizations could be augmented with text-based EHR information. Second, advancing clinical NLP methods will benefit from open source, sharable resources and software. Dr. Chute mentions the important contributions made by the Strategic Health IT [information technology] Advanced Research Projects (SHARP). Like others, we have benefitted from SHARP-produced tools, including software modules that detect when concepts of interest are qualified by negation or uncertainty—essential details for interpreting cancer screening and diagnosis narratives. Indeed, in the spirit of advancing portability and reproducibility, we have contributed to the Apache cTAKES (4) NLP system. Also essential are sharable samples of clinical text that have been annotated for syntactic and semantic content. Availability of such annotated text in the general domain since the 1990s (5) propelled NLP far beyond the current state of clinical NLP. Achieving parity in the clinical domain, particularly in the application of powerful machine learning approaches, will require additional sharable resources. To accelerate the broad use of clinical NLP technologies, developers should also strive to make their systems more easily portable, including to settings where informatics expertise is limited, thereby extending their potential impact. Third, protecting patient privacy is critically important when using EHR data for research. We agree with Dr. Chute that text deidentification will play an important role in facilitating access while protecting patient privacy. Methods to address the trace amounts of residual identifiers that automated deidentification tools overlook are needed (6). That said, machine processing has inherent advantages over manual abstraction. Though automated systems may “touch” far greater quantities of clinical text, they entail substantially less exposure of patient information than manual abstraction because relatively little human review is needed. Institutional review boards appreciate this. Indeed, the Group Health (Seattle, Washington) institutional review board welcomed the adoption of NLP methods and the construction of a research database for EHR text because they were convinced this would reduce threats to patient privacy. We also point out that, once text-based information has been locally extracted, it can be normalized to standardized coding schemes, free of direct patient identifiers, allowing it to be more easily sharable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Carrell et al. Respond to "Observational Research and the EHR"

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology

Lead the way for us

Journal: American Journal of Epidemiology	Publication Date: Jan 30, 2014
Citations: 3

Similar Papers

Development and assessment of a natural language processing model to identify residential instability in electronic health records' unstructured data: a comparison of 3 integrated healthcare delivery systems.
Elham Hatef ... Christopher Rouillard
JAMIA open | VOL. 5
Elham Hatef, et. al.Elham Hatef ... Christopher Rouillard
07 Jan 2022
JAMIA open | VOL. 5

Natural Language Processing Basics.
Naveen Arivazhagan ... Tielman T Van Vleck
Clinical Journal of the American Society of Nephrology | VOL. 18
Naveen Arivazhagan, et. al.Naveen Arivazhagan ... Tielman T Van Vleck
08 Feb 2023
Clinical Journal of the American Society of Nephrology | VOL. 18

Natural Language Processing Improves Identification of Colorectal Cancer Testing in the Electronic Medical Record
Joshua C Denny ... Lisa Bastarache
Medical Decision Making | VOL. 32
Joshua C Denny, et. al.Joshua C Denny ... Lisa Bastarache
10 Mar 2011
Medical Decision Making | VOL. 32

Applications of Natural Language Processing in Clinical Research and Practice
Yanshan Wang ... Ahmad Tafti
-
Yanshan Wang, et. al.Yanshan Wang ... Ahmad Tafti
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Carrell et al. Respond to "Observational Research and the EHR"

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology