Contextual Language Model Research Articles

BackgroundThe tenth revision of the International Classification of Diseases (ICD-10) is widely used for epidemiological research and health management. The clinical modification (CM) and procedure coding system (PCS) of ICD-10 were developed to describe more clinical details with increasing diagnosis and procedure codes and applied in disease-related groups for reimbursement. The expansion of codes made the coding time-consuming and less accurate. The state-of-the-art model using deep contextual word embeddings was used for automatic multilabel text classification of ICD-10. In addition to input discharge diagnoses (DD), the performance can be improved by appropriate preprocessing methods for the text from other document types, such as medical history, comorbidity and complication, surgical method, and special examination.ObjectiveThis study aims to establish a contextual language model with rule-based preprocessing methods to develop the model for ICD-10 multilabel classification.MethodsWe retrieved electronic health records from a medical center. We first compared different word embedding methods. Second, we compared the preprocessing methods using the best-performing embeddings. We compared biomedical bidirectional encoder representations from transformers (BioBERT), clinical generalized autoregressive pretraining for language understanding (Clinical XLNet), label tree-based attention-aware deep model for high-performance extreme multilabel text classification (AttentionXLM), and word-to-vector (Word2Vec) to predict ICD-10-CM. To compare different preprocessing methods for ICD-10-CM, we included DD, medical history, and comorbidity and complication as inputs. We compared the performance of ICD-10-CM prediction using different preprocesses, including definition training, external cause code removal, number conversion, and combination code filtering. For the ICD-10 PCS, the model was trained using different combinations of DD, surgical method, and key words of special examination. The micro F1 score and the micro area under the receiver operating characteristic curve were used to compare the model’s performance with that of different preprocessing methods.ResultsBioBERT had an F1 score of 0.701 and outperformed other models such as Clinical XLNet, AttentionXLM, and Word2Vec. For the ICD-10-CM, the model had an F1 score that significantly increased from 0.749 (95% CI 0.744-0.753) to 0.769 (95% CI 0.764-0.773) with the ICD-10 definition training, external cause code removal, number conversion, and combination code filter. For the ICD-10-PCS, the model had an F1 score that significantly increased from 0.670 (95% CI 0.663-0.678) to 0.726 (95% CI 0.719-0.732) with a combination of discharge diagnoses, surgical methods, and key words of special examination. With our preprocessing methods, the model had the highest area under the receiver operating characteristic curve of 0.853 (95% CI 0.849-0.855) and 0.831 (95% CI 0.827-0.834) for ICD-10-CM and ICD-10-PCS, respectively.ConclusionsThe performance of our model with the pretrained contextualized language model and rule-based preprocessing method is better than that of the state-of-the-art model for ICD-10-CM or ICD-10-PCS. This study highlights the importance of rule-based preprocessing methods based on coder coding rules.

ObjectiveWith increasing patient complexity whose data are stored in fragmented health information systems, automated and time-efficient ways of gathering important information from the patients' medical history are needed for effective clinical decision making. Using COVID-19 as a case study, we developed a query-bot information retrieval system with user-feedback to allow clinicians to ask natural questions to retrieve data from patient notes. Materials and methodsWe applied clinicalBERT, a pre-trained contextual language model, to our dataset of patient notes to obtain sentence embeddings, using K-Means to reduce computation time for real-time interaction. Rocchio algorithm was then employed to incorporate user-feedback and improve retrieval performance. ResultsIn an iterative feedback loop experiment, MAP for final iteration was 0.93/0.94 as compared to initial MAP of 0.66/0.52 for generic and 1./1. compared to 0.79/0.83 for COVID-19 specific queries confirming that contextual model handles the ambiguity in natural language queries and feedback helps to improve retrieval performance. User-in-loop experiment also outperformed the automated pseudo relevance feedback method. Moreover, the null hypothesis which assumes identical precision between initial retrieval and relevance feedback was rejected with high statistical significance (p ≪ 0.05). Compared to Word2Vec, TF-IDF and bioBERT models, clinicalBERT works optimally considering the balance between response precision and user-feedback. DiscussionOur model works well for generic as well as COVID-19 specific queries. However, some generic queries are not answered as well as others because clustering reduces query performance and vague relations between queries and sentences are considered non-relevant. We also tested our model for queries with the same meaning but different expressions and demonstrated that these query variations yielded similar performance after incorporation of user-feedback. ConclusionIn conclusion, we develop an NLP-based query-bot that handles synonyms and natural language ambiguity in order to retrieve relevant information from the patient chart. User-feedback is critical to improve model performance.

Contextual Language Model Research Articles

Related Topics

Articles published on Contextual Language Model

Automatic International Classification of Diseases Coding System: Deep Contextualized Language Model With Rule-Based Approaches.

Geoscience language models and their intrinsic evaluation

To Augment or Not to Augment? A Comparative Study on Text Augmentation Techniques for Low-Resource NLP

ABNIRML: Analyzing the Behavior of Neural IR Models

Evaluation of weakly-supervised methods for aspect extraction

Adaptable Closed-Domain Question Answering Using Contextualized CNN-Attention Models and Question Expansion

Deep contextual multi-task feature fusion for enhanced concept, negation and speculation detection from clinical notes

A contextual multi-task neural approach to medication and adverse events identification from clinical text

Does the magic of BERT apply to medical code assignment? A quantitative study

Automated quality assessment of cognitive behavioral therapy sessions through highly contextualized language representations.

Continual knowledge infusion into pre-trained biomedical language models.

Query bot for retrieving patients’ clinical history: A COVID-19 use-case

How well do pre-trained contextual language representations recommend labels for GitHub issues?

Qualifying Certainty in Radiology Reports through Deep Learning-Based Natural Language Processing.

Enhancing argumentation component classification using contextual language model

Investigating Gender Bias in BERT

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

ELASPIC2 (EL2): Combining Contextualized Language Models and Graph Neural Networks to Predict Effects of Mutations

Classical Arabic Named Entity Recognition Using Variant Deep Neural Network Architectures and BERT

Unified Transformer Multi-Task Learning for Intent Classification With Entity Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Contextual Language Model Research Articles

Related Topics

Articles published on Contextual Language Model

Automatic International Classification of Diseases Coding System: Deep Contextualized Language Model With Rule-Based Approaches.

Geoscience language models and their intrinsic evaluation

To Augment or Not to Augment? A Comparative Study on Text Augmentation Techniques for Low-Resource NLP

ABNIRML: Analyzing the Behavior of Neural IR Models

Evaluation of weakly-supervised methods for aspect extraction

Adaptable Closed-Domain Question Answering Using Contextualized CNN-Attention Models and Question Expansion

Deep contextual multi-task feature fusion for enhanced concept, negation and speculation detection from clinical notes

A contextual multi-task neural approach to medication and adverse events identification from clinical text

Does the magic of BERT apply to medical code assignment? A quantitative study

Automated quality assessment of cognitive behavioral therapy sessions through highly contextualized language representations.

Continual knowledge infusion into pre-trained biomedical language models.

Query bot for retrieving patients’ clinical history: A COVID-19 use-case

How well do pre-trained contextual language representations recommend labels for GitHub issues?

Qualifying Certainty in Radiology Reports through Deep Learning-Based Natural Language Processing.

Enhancing argumentation component classification using contextual language model

Investigating Gender Bias in BERT

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

ELASPIC2 (EL2): Combining Contextualized Language Models and Graph Neural Networks to Predict Effects of Mutations

Classical Arabic Named Entity Recognition Using Variant Deep Neural Network Architectures and BERT

Unified Transformer Multi-Task Learning for Intent Classification With Entity Recognition