The Food and Drug Administration Biologics Effectiveness and Safety Initiative Facilitates Detection of Vaccine Administrations From Unstructured Data in Medical Records Through Natural Language Processing.

Matthew Deady,Jeno Pizarro,Douglas Billings,Amalia A Plotogea,Kerry Cook,Patrick Saunders-Hastings,Hussein Ezzeldin,Barbee I Whitaker,Steven A Anderson,Artur Belov

doi:10.3389/fdgth.2021.777905

Matthew Deady, Jeno Pizarro + Show 8 more

Open Access

PDF Available

https://doi.org/10.3389/fdgth.2021.777905

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Introduction: The Food and Drug Administration Center for Biologics Evaluation and Research conducts post-market surveillance of biologic products to ensure their safety and effectiveness. Studies have found that common vaccine exposures may be missing from structured data elements of electronic health records (EHRs), instead being captured in clinical notes. This impacts monitoring of adverse events following immunizations (AEFIs). For example, COVID-19 vaccines have been regularly administered outside of traditional medical settings. We developed a natural language processing (NLP) algorithm to mine unstructured clinical notes for vaccinations not captured in structured EHR data.Methods: A random sample of 1,000 influenza vaccine administrations, representing 995 unique patients, was extracted from a large U.S. EHR database. NLP techniques were used to detect administrations from the clinical notes in the training dataset [80% (N = 797) of patients]. The algorithm was applied to the validation dataset [20% (N = 198) of patients] to assess performance. Full medical charts for 28 randomly selected administration events in the validation dataset were reviewed by clinicians. The NLP algorithm was then applied across the entire dataset (N = 995) to quantify the number of additional events identified.Results: A total of 3,199 administrations were identified in the structured data and clinical notes combined. Of these, 2,740 (85.7%) were identified in the structured data, while the NLP algorithm identified 1,183 (37.0%) administrations in clinical notes; 459 were not also captured in the structured data. This represents a 16.8% increase in the identification of vaccine administrations compared to using structured data alone. The validation of 28 vaccine administrations confirmed 27 (96.4%) as “definite” vaccine administrations; 18 (64.3%) had evidence of a vaccination event in the structured data, while 10 (35.7%) were found solely in the unstructured notes.Discussion: We demonstrated the utility of an NLP algorithm to identify vaccine administrations not captured in structured EHR data. NLP techniques have the potential to improve detection of vaccine administrations not otherwise reported without increasing the analysis burden on physicians or practitioners. Future applications could include refining estimates of vaccine coverage and detecting other exposures, population characteristics, and outcomes not reliably captured in structured EHR data.

Highlights

The Food and Drug Administration Center for Biologics Evaluation and Research conducts post-market surveillance of biologic products to ensure their safety and effectiveness
We developed and applied a simple natural language processing (NLP) algorithm to extract vaccine administration data from the clinical notes in a large academic health system’s electronic health records (EHRs)
A rapid validation exercise of 28 NLP-identified vaccine administration events produced a positive predictive value (PPV) of 96.4% (Wilson 95% Wilson confidence intervals (95% CIs) range: 82.3–99.4%), validating the utility of the NLP algorithm to accurately detect vaccine administrations in the unstructured clinical data with minimal false positives

Summary

Introduction

The Food and Drug Administration Center for Biologics Evaluation and Research conducts post-market surveillance of biologic products to ensure their safety and effectiveness. Studies have found that common vaccine exposures may be missing from structured data elements of electronic health records (EHRs), instead being captured in clinical notes. This impacts monitoring of adverse events following immunizations (AEFIs). Vaccines are rigorously evaluated for safety prior to licensure, there is the possibility of adverse events following immunizations (AEFIs) occurring with exposure post-licensure in a larger general population, compared to limited exposure in the pre-licensure clinical trials. Rigorous post-market vaccine safety surveillance studies that are powered to detect rare AEs are needed to address some of the limitations of randomized clinical trials [7, 8]

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Digital Health	Publication Date: Dec 22, 2021
Citations: 5	License type: CC BY 4.0

R Discovery Prime

The Food and Drug Administration Biologics Effectiveness and Safety Initiative Facilitates Detection of Vaccine Administrations From Unstructured Data in Medical Records Through Natural Language Processing.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Frontiers in Digital Health

Lead the way for us

Similar Papers

Table_1.docx
-
-
--
22 Dec 2021
22 Dec 2021

The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification.
Hadi Kharrazi ... Cynthia M Boyd
Journal of the American Geriatrics Society | VOL. 66
Hadi Kharrazi, et. al.Hadi Kharrazi ... Cynthia M Boyd
04 Jul 2018
Journal of the American Geriatrics Society | VOL. 66

Development and validation of natural language processing (NLP) algorithm for detection of distant versus local breast cancer recurrence and metastatic site.
Yasmin Karimi ... Douglas W Blayney
Journal of Clinical Oncology | VOL. 38
Yasmin Karimi, et. al.Yasmin Karimi ... Douglas W Blayney
20 May 2020
Journal of Clinical Oncology | VOL. 38

Identifying Diabetes Related-Complications in a Real-World Free-Text Electronic Medical Records in Hebrew Using Natural Language Processing Techniques.
Ariel Israel ... Shlomo Vinker
Journal of diabetes science and technology | VOL. -
Ariel Israel, et. al.Ariel Israel ... Shlomo Vinker
30 Jan 2024
Journal of diabetes science and technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

The Food and Drug Administration Biologics Effectiveness and Safety Initiative Facilitates Detection of Vaccine Administrations From Unstructured Data in Medical Records Through Natural Language Processing.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Frontiers in Digital Health