Leveraging weak supervision to perform named entity recognition in electronic health records progress notes to identify the ophthalmology exam

Sophia Y Wang,Justin Huang,Hannah Hwang,Wendeng Hu,Shiqi Tao,Tina Hernandez-Boussard

doi:10.1016/j.ijmedinf.2022.104864

Sophia Y Wang, Justin Huang + Show 4 more

Open Access

https://doi.org/10.1016/j.ijmedinf.2022.104864

Copy DOI

Journal: International Journal of Medical Informatics	Publication Date: Sep 16, 2022
Citations: 12	License type: publisher-specific-oa

Affiliation: Stanford University

Abstract

ObjectiveTo develop deep learning models to recognize ophthalmic examination components from clinical notes in electronic health records (EHR) using a weak supervision approach. MethodsA corpus of 39,099 ophthalmology notes weakly labeled for 24 examination entities was assembled from the EHR of one academic center. Four pre-trained transformer-based language models (DistilBert, BioBert, BlueBert, and ClinicalBert) were fine-tuned to this named entity recognition task and compared to a baseline regular expression model. Models were evaluated on the weakly labeled test dataset, a human-labeled sample of that set, and a human-labeled independent dataset. ResultsOn the weakly labeled test set, all transformer-based models had recall > 0.93, with precision varying from 0.815 to 0.843. The baseline model had lower recall (0.769) and precision (0.682). On the human-annotated sample, the baseline model had high recall (0.962, 95 % CI 0.955–0.067) with variable precision across entities (0.081–0.999). Bert models had recall ranging from 0.771 to 0.831, and precision >=0.973. On the independent dataset, precision was 0.926 and recall 0.458 for BlueBert. The baseline model had better recall (0.708, 95 % CI 0.674–0.738) but worse precision (0.399, 95 % CI -0.352–0.451). ConclusionWe developed the first deep learning system to recognize eye examination components from clinical notes, leveraging a novel opportunity for weak supervision. Transformer-based models had high precision on human-annotated labels, whereas the baseline model had poor precision but higher recall. This system may be used to improve cohort and feature identification using free-text notes.Our weakly supervised approach may help amass large datasets of domain-specific entities from EHRs in many fields.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging weak supervision to perform named entity recognition in electronic health records progress notes to identify the ophthalmology exam

Abstract

Talk to us

Similar Papers

More From: International Journal of Medical Informatics

Lead the way for us

Similar Papers

HIV Risk Assessment using Longitudinal Electronic Health Records
Daniel Feller ... Jason Zucker
Open Forum Infectious Diseases | VOL. 4
Daniel Feller, et. al.Daniel Feller ... Jason Zucker
01 Oct 2017
Open Forum Infectious Diseases | VOL. 4

Adapting transformer-based language models for heart disease detection and risk factors extraction
Essam H Houssein ... Abdelmgeid A Ali
Journal of Big Data | VOL. 11
Essam H Houssein, et. al.Essam H Houssein ... Abdelmgeid A Ali
04 Apr 2024
Journal of Big Data | VOL. 11

A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language Models
Dang Van Thin ... Duong Ngoc Hao
Vietnam Journal of Computer Science | VOL. 11
Dang Van Thin, et. al.Dang Van Thin ... Duong Ngoc Hao
07 Dec 2023
Vietnam Journal of Computer Science | VOL. 11

Identification of Semantically Similar Sentences in Clinical Notes: Iterative Intermediate Training Using Multi-Task Learning.
Diwakar Mahajan ... Ananya Poddar
JMIR Medical Informatics | VOL. 8
Diwakar Mahajan, et. al.Diwakar Mahajan ... Ananya Poddar
27 Nov 2020
JMIR Medical Informatics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging weak supervision to perform named entity recognition in electronic health records progress notes to identify the ophthalmology exam

Abstract

Talk to us

Similar Papers

More From: International Journal of Medical Informatics