Clinical Electronic Medical Records Research Articles

To address the problem of poor entity recognition performance caused by the lack of Chinese annotation in clinical electronic medical records, this paper proposes a multi-medical entity recognition method F-MNER using a fusion technique combining BART, Bi-LSTM, and CRF. First, after cleaning, encoding, and segmenting the electronic medical records, the obtained semantic representations are dynamically fused using a bidirectional autoregressive transformer (BART) model. Then, sequential information is captured using a bidirectional long short-term memory (Bi-LSTM) network. Finally, the conditional random field (CRF) is used to decode and output multi-task entity recognition. Experiments are performed on the CCKS2019 dataset, with micro avg Precision, macro avg Recall, weighted avg Precision reaching 0.880, 0.887, and 0.883, and micro avg F1-score, macro avg F1-score, weighted avg F1-score reaching 0.875, 0.876, and 0.876 respectively. Compared with existing models, our method outperforms the existing literature in three evaluation metrics (micro average, macro average, weighted average) under the same dataset conditions. In the case of weighted average, the Precision, Recall, and F1-score are 19.64%, 15.67%, and 17.58% higher than the existing BERT-BiLSTM-CRF model respectively. Experiments are performed on the actual clinical dataset with our MF-MNER, the Precision, Recall, and F1-score are 0.638, 0.825, and 0.719 under the micro-avg evaluation mechanism. The Precision, Recall, and F1-score are 0.685, 0.800, and 0.733 under the macro-avg evaluation mechanism. The Precision, Recall, and F1-score are 0.647, 0.825, and 0.722 under the weighted avg evaluation mechanism. The above results show that our method MF-MNER can integrate the advantages of BART, Bi-LSTM, and CRF layers, significantly improving the performance of downstream named entity recognition tasks with a small amount of annotation, and achieving excellent performance in terms of recall score, which has certain practical significance. Source code and datasets to reproduce the results in this paper are available at https://github.com/xfwang1969/MF-MNER .

Read full abstract

IntroductionThe proliferation and use of electronic medical records (EMR) in the clinical setting now provide a rich source of clinical data that can be leveraged to support research on patient outcomes, comparative effectiveness, and health systems research. Once the large volume and variety of data that robust clinical EMRs provide is aggregated, the suitability of the data for research purposes must be addressed. Therefore, the purpose of this paper is two-fold. First, we present a stepwise framework capable of guiding initial data quality assessment when matching multiple data sources regardless of context or application. Then, we demonstrate a use case of initial analysis of a longitudinal data repository of electronic health record data that illustrates the first four steps of the framework, and report results. MethodsA six-step data quality assessment framework is proposed and described that includes the following data quality assessment steps: (1) preliminary analysis, (2) documentation–longitudinal concordance, (3) breadth, (4) data element presence, (5) density, and (6) prediction. The six-step framework was applied to the Transport Data Mart—a data repository that contains over 28,000 records for patients that underwent interhospital transfer that includes EMRs from the sending hospitalization, transport, and receiving hospitalization. ResultsThere were a total of 9557 log entries of which 8139 were successfully matched to corresponding hospital encounters. 2832 were successfully mapped to both the sending and receiving hospital encounters (resulting in a 93% automatic matching rate), with 590 including air medical transport EMR data representing a complete case for testing. Results from Step 2 indicate that once records are identified and matched, there appears to be relatively limited drop-off of additional records when the criteria for matching increases, indicating the a proportion of records consistently contain nearly complete data. Measures of central tendency used in Step 3 and 4 exhibit a right skewness suggesting that a small proportion of records contain the highest number of repeated measures for the measured variables. ConclusionsThe proposed six-step data quality assessment framework is useful in establishing the metadata for a longitudinal data repository that can be replicated by other studies. There are practical issues that need to be addressed including the data quality assessments—with the most prescient being the need to establish data quality metrics for benchmarking acceptable levels of EMR data inclusiveness through testing and application.

Read full abstract

Clinical Electronic Medical Records Research Articles

Articles published on Clinical Electronic Medical Records

MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records.

User Perceptions of Academic and Clinical Electronic Medical Records in an Undergraduate Nursing Program

Tibetan Medical Named Entity Recognition Study for Tibetan Clinical Electronic Medical Records

An Improved Method for Named Entity Recognition and Its Application to CEMR

Identification of validated case definitions for medical conditions used in primary care electronic medical record databases: a systematic review

Harnessing the Power of Big Data to Improve Graduate Medical Education: Big Idea or Bust?

Data quality assessment framework to assess electronic medical record data for use in research

CSER and eMERGE: current and potential state of the display of genetic information in the electronic health record.

Combination of symptoms, syndrome and disease: treatment of refractory diabetic gastroparesis.

Marrying Work and Technical Artifact within the Healthcare Organization: A Work Network Approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Clinical Electronic Medical Records Research Articles

Articles published on Clinical Electronic Medical Records

MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records.

User Perceptions of Academic and Clinical Electronic Medical Records in an Undergraduate Nursing Program

Tibetan Medical Named Entity Recognition Study for Tibetan Clinical Electronic Medical Records

An Improved Method for Named Entity Recognition and Its Application to CEMR

Identification of validated case definitions for medical conditions used in primary care electronic medical record databases: a systematic review

Harnessing the Power of Big Data to Improve Graduate Medical Education: Big Idea or Bust?

Data quality assessment framework to assess electronic medical record data for use in research

CSER and eMERGE: current and potential state of the display of genetic information in the electronic health record.

Combination of symptoms, syndrome and disease: treatment of refractory diabetic gastroparesis.

Marrying Work and Technical Artifact within the Healthcare Organization: A Work Network Approach