Identification of asthma control factor in clinical notes using a hybrid deep learning model

Bhavani Singh Agnikula Kshatriya,Jungwon Yoon,Elham Sagheb,Young Juhn,Sunghwan Sohn,Hee Yun Seol,Chung-Il Wi

doi:10.1186/s12911-021-01633-4

Abstract

BackgroundThere are significant variabilities in guideline-concordant documentation in asthma care. However, assessing clinician’s documentation is not feasible using only structured data but requires labor-intensive chart review of electronic health records (EHRs). A certain guideline element in asthma control factors, such as review inhaler techniques, requires context understanding to correctly capture from EHR free text.MethodsThe study data consist of two sets: (1) manual chart reviewed data—1039 clinical notes of 300 patients with asthma diagnosis, and (2) weakly labeled data (distant supervision)—27,363 clinical notes from 800 patients with asthma diagnosis. A context-aware language model, Bidirectional Encoder Representations from Transformers (BERT) was developed to identify inhaler techniques in EHR free text. Both original BERT and clinical BioBERT (cBERT) were applied with a cost-sensitivity to deal with imbalanced data. The distant supervision using weak labels by rules was also incorporated to augment the training set and alleviate a costly manual labeling process in the development of a deep learning algorithm. A hybrid approach using post-hoc rules was also explored to fix BERT model errors. The performance of BERT with/without distant supervision, hybrid, and rule-based models were compared in precision, recall, F-score, and accuracy.ResultsThe BERT models on the original data performed similar to a rule-based model in F1-score (0.837, 0.845, and 0.838 for rules, BERT, and cBERT, respectively). The BERT models with distant supervision produced higher performance (0.853 and 0.880 for BERT and cBERT, respectively) than without distant supervision and a rule-based model. The hybrid models performed best in F1-score of 0.877 and 0.904 over the distant supervision on BERT and cBERT.ConclusionsThe proposed BERT models with distant supervision demonstrated its capability to identify inhaler techniques in EHR free text, and outperformed both the rule-based model and BERT models trained on the original data. With a distant supervision approach, we may alleviate costly manual chart review to generate the large training data required in most deep learning-based models. A hybrid model was able to fix BERT model errors and further improve the performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Nov 1, 2021
Citations: 14	License type: open-access

R Discovery Prime

R Discovery Prime

Identification of asthma control factor in clinical notes using a hybrid deep learning model

Abstract

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

Engineering Document Summarization Using Sentence Representations Generated by Bidirectional Language Model
Yan Jin ... Yunjian Qiu
-
Yan Jin, et. al.Yan Jin ... Yunjian Qiu
17 Aug 2021
17 Aug 2021

Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT
Shoya Wada ... Yasushi Matsumura
Artificial Intelligence In Medicine | VOL. 153
Shoya Wada, et. al.Shoya Wada ... Yasushi Matsumura
05 May 2024
Artificial Intelligence In Medicine | VOL. 153

Deep Learning Identification of Asthma Inhaler Techniques in Clinical Notes.
Bhavani Singh Agnikula Kshatriya ... Chung-Il Wi
Proceedings. IEEE International Conference on Bioinformatics and Biomedicine | VOL. 2020
Bhavani Singh Agnikula Kshatriya, et. al.Bhavani Singh Agnikula Kshatriya ... Chung-Il Wi
16 Dec 2020
Proceedings. IEEE International Conference on Bioinformatics and Biomedicine | VOL. 2020

Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.
Kyungmo Kim ... Jinwook Choi
JMIR medical informatics | VOL. 12
Kyungmo Kim, et. al.Kyungmo Kim ... Jinwook Choi
30 Oct 2024
JMIR medical informatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of asthma control factor in clinical notes using a hybrid deep learning model

Abstract

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making