An Attention-Based BiLSTM-CRF Model for Chinese Clinic Named Entity Recognition

Guohua Wu,Zhen Wang,Zhongru Wang,Zhen Zhang,Guangen Tang

doi:10.1109/access.2019.2935223

Guohua Wu, Zhen Wang + Show 3 more

Open Access

https://doi.org/10.1109/access.2019.2935223

Copy DOI

Abstract

Clinic Named Entity Recognition (CNER) aims to recognize named entities such as body part, disease and symptom from Electronic Health Records (EHRs), which can benefit many intelligent biomedical systems. In recent years, more and more attention has been paid to the end-to-end CNER with recurrent neural networks (RNNs), especially for long short-term memory networks (LSTMs). However, it remains a great challenge for RNNs to capture long range dependencies. Moreover, Chinese presents additional challenges, since it uses logograms instead of alphabets, the ambiguities of Chinese word and has no word boundaries. In this work, we present a BiLSTM-CRF with self-attention mechanism (Att-BiLSTM-CRF) model for Chinese CNER task, which aims to address these problems. Self-attention mechanism can learn long range dependencies by establishing a direct connection between each character. In order to learn more semantic information about Chinese characters, we propose a novel fine-grained character-level representation method. We also introduce part-of-speech (POS) labeling information about our model to capture the semantic information in input sentence. We conduct the experiment by using CCKS-2017 Shared Task 2 dataset to evaluate performance, and the experimental results indicated that our model outperforms other state-of-the-art methods.

Highlights

Clinic Named Entity Recognition (CNER) is a basic and important natural language processing (NLP) task in the clinical and digital health research
In order to learn more semantic information of Chinese characters, we propose a novel fine-grained character-level representation method
Contributions of this paper can be summarized as follows: 1) We proposed an Att-Bi-directional Long Short-Term Memory (BiLSTM)-CRF model to perform the Chinese CNER task

Summary

INTRODUCTION

Clinic Named Entity Recognition (CNER) is a basic and important natural language processing (NLP) task in the clinical and digital health research. To address the above problems, Hu et al [24] utilize an ensemble method which consists of rules, CRF and LSTM based models for the Chinese CNER task They added training data by using a self-training algorithm to improve the performance. Xia and Wang [31] proposed a BiLSTM-CRF model with self-training and ensemble learning algorithm to address the Chinese CNER task. We utilize a novel fine-grained character-level representation method to obtain more semantic information of Chinese characters, and we introduce the POS labeling information into our model to learn semantic information of input sentences. The structure of our model was described

EMBEDDING LAYER

BILSTM LAYER

SELF-ATTENTION LAYER

CRF LAYER

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 61	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Attention-Based BiLSTM-CRF Model for Chinese Clinic Named Entity Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Neural Machine Translation for Low Resource Assamese–English
Sahinur Rahman Laskar ... Partha Pakray
-
Sahinur Rahman Laskar, et. al.Sahinur Rahman Laskar ... Partha Pakray
01 Jan 2020
01 Jan 2020

Deep Clustering Efficient Learning Network for Motion Recognition Based on Self-Attention Mechanism
Tielin Ru ... Ziheng Zhu
Applied Sciences | VOL. 13
Tielin Ru, et. al.Tielin Ru ... Ziheng Zhu
26 Feb 2023
Applied Sciences | VOL. 13

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism
Pengfei Cao ... Kang Liu
-
Pengfei Cao, et. al.Pengfei Cao ... Kang Liu
01 Jan 2018
01 Jan 2018

Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation
Weijie Wang ... An Fang
JMIR Medical Informatics | VOL. 11
Weijie Wang, et. al.Weijie Wang ... An Fang
10 May 2023
JMIR Medical Informatics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Attention-Based BiLSTM-CRF Model for Chinese Clinic Named Entity Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access