A hybrid approach for named entity recognition in Chinese electronic medical record

Bin Ji,Jie Yu,Qingbo Wu,Rui Liu,Yusong Tan,Jiaju Wu,Shasha Li

doi:10.1186/s12911-019-0767-2

Abstract

BackgroundWith the rapid spread of electronic medical records and the arrival of medical big data era, the application of natural language processing technology in biomedicine has become a hot research topic.MethodsIn this paper, firstly, BiLSTM-CRF model is applied to medical named entity recognition on Chinese electronic medical record. According to the characteristics of Chinese electronic medical records, obtain the low-dimensional word vector of each word in units of sentences. And then input the word vector to BiLSTM to realize automatic extraction of sentence features. And then CRF performs sentence-level word tagging. Secondly, attention mechanism is added between the BiLSTM and the CRF to construct Attention-BiLSTM-CRF model, which can leverage document-level information to alleviate tagging inconsistency. In addition, this paper proposes an entity auto-correct algorithm to rectify entities according to historical entity information. At last, a drug dictionary and post-processing rules are well-built to rectify entities, to further improve performance.ResultsThe final F1 scores of the BiLSTM-CRF and Attention-BiLSTM-CRF model on given test dataset are 90.15 and 90.82% respectively, both of which are higher than 89.26%, which is the best F1 score on the test dataset except ours.ConclusionOur approach can be used to recognize medical named entity on Chinese electronic medical records and achieves the state-of-the-art performance on the given test dataset.

Highlights

With the rapid spread of electronic medical records and the arrival of medical big data era, the application of natural language processing technology in biomedicine has become a hot research topic
By adding attention mechanism into BiLSTM-conditional random field (CRF), we construct Attention-BiLSTM-CRF model and apply it to Named Entity Recognition (NER) in Chinese Electronic medical record (EMR), which aims at alleviating tagging inconsistency problem by leveraging document-level information
The test results of the two neural network models on the given test dataset are shown in Table 3, which are provided by the Conference on Knowledge Graph and Semantic Computing (CCKS) 2018 evaluation platform [19], and the definition of strict index can be found on this evaluation platform, too

Summary

Introduction

With the rapid spread of electronic medical records and the arrival of medical big data era, the application of natural language processing technology in biomedicine has become a hot research topic. Named Entity Recognition (NER) is a basic task in Natural Language Processing (NLP). NER is to recognize three kinds of named entity, which are name, place, and organization [1]. With rapid development of electronic medical records and clinical information, doctors need information-based. Different hospitals and even different doctors may name the same entity differently; secondly, there may be several names for one entity, e.g. a drug can have tens of trade names; thirdly, new entities are constantly being created; last but not least, usage of Chinese is flexible. Some words cannot be judged as named entities without context, and there is no space between Chinese characters as boundary mark

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC medical informatics and decision making	Publication Date: Apr 1, 2019
Citations: 49	License type: open-access

R Discovery Prime

R Discovery Prime

A hybrid approach for named entity recognition in Chinese electronic medical record

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC medical informatics and decision making

Lead the way for us

Similar Papers

A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition
Bin Ji ... Rui Liu
-
Bin Ji, et. al.Bin Ji ... Rui Liu
21 Dec 2018
21 Dec 2018

Combining External Medical Knowledge for Improving Obstetric Intelligent Diagnosis: Model Development and Validation
Kunli Zhang ... Tao Liu
JMIR medical informatics | VOL. 9
Kunli Zhang, et. al.Kunli Zhang ... Tao Liu
10 May 2021
JMIR medical informatics | VOL. 9

Named Entity Recognition in Chinese Electronic Medical Record Using Attention Mechanism
Menglong Li ... Jing Chen
-
Menglong Li, et. al.Menglong Li ... Jing Chen
01 Jul 2019
01 Jul 2019

Entity relationship extraction from Chinese electronic medical records based on feature augmentation and cascade binary tagging framework.
Xiaoqing Lu ... Shudong Xia
Mathematical Biosciences and Engineering | VOL. 21
Xiaoqing Lu, et. al.Xiaoqing Lu ... Shudong Xia
01 Jan 2023
Mathematical Biosciences and Engineering | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A hybrid approach for named entity recognition in Chinese electronic medical record

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC medical informatics and decision making