Multi-task learning for Chinese clinical named entity recognition with external knowledge

Ming Cheng,Shufeng Xiong,Pan Liang,Jianbo Gao,Fei Li

doi:10.1186/s12911-021-01717-1

Abstract

BackgroundNamed entity recognition (NER) on Chinese electronic medical/healthcare records has attracted significantly attentions as it can be applied to building applications to understand these records. Most previous methods have been purely data-driven, requiring high-quality and large-scale labeled medical data. However, labeled data is expensive to obtain, and these data-driven methods are difficult to handle rare and unseen entities.MethodsTo tackle these problems, this study presents a novel multi-task deep neural network model for Chinese NER in the medical domain. We incorporate dictionary features into neural networks, and a general secondary named entity segmentation is used as auxiliary task to improve the performance of the primary task of named entity recognition.ResultsIn order to evaluate the proposed method, we compare it with other currently popular methods, on three benchmark datasets. Two of the datasets are publicly available, and the other one is constructed by us. Experimental results show that the proposed model achieves 91.07% average f-measure on the two public datasets and 87.05% f-measure on private dataset.ConclusionsThe comparison results of different models demonstrated the effectiveness of our model. The proposed model outperformed traditional statistical models.

Highlights

With rapid development of Electronic Medical Records (EMRs) systems, there has been an increasing interest in applying text mining and information extraction to the EMRs
The main contributions of this article are as follows: (1) We present a multi-task learning framework which jointly trains a model to perform entity segmentation with cross-entropy loss and entity recognition task with Conditional Random Fields (CRF)
The Chinese clinical Named entity recognition (NER) task is usually known as a sequence labelling task, while Named Entity Segmentation (NES) task is considered as binary classification task of whether a token is entity or not

Summary

Introduction

With rapid development of Electronic Medical Records (EMRs) systems, there has been an increasing interest in applying text mining and information extraction to the EMRs. Among the medical texts mining tasks, NER is a fundamental task which locates the mentions of named entities and classifies them (e.g. symptoms, tests, drugs, operations and diseases, etc.) in unstructured medical/healthcare. The deep models usually require a large amount of labeled data for training, while manual annotation is time-consuming. In order to alleviate the dependence of large annotation data, some researchers proposed to integrate prior knowledge into the models [9]. Named entity recognition (NER) on Chinese electronic medical/healthcare records has attracted significantly attentions as it can be applied to building applications to understand these records. Most previous methods have been purely data-driven, requiring high-quality and large-scale labeled medical data. Labeled data is expensive to obtain, and these data-driven methods are difficult to handle rare and unseen entities

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Dec 1, 2021
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

Multi-task learning for Chinese clinical named entity recognition with external knowledge

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation
Weijie Wang ... An Fang
JMIR Medical Informatics | VOL. 11
Weijie Wang, et. al.Weijie Wang ... An Fang
10 May 2023
JMIR Medical Informatics | VOL. 11

Leveraging Multi-source knowledge for Chinese clinical named entity recognition via relational graph convolutional network.
Ying Xiong ... Jun Yan
Journal of Biomedical Informatics | VOL. 128
Ying Xiong, et. al.Ying Xiong ... Jun Yan
01 Apr 2022
Journal of Biomedical Informatics | VOL. 128

Chinese clinical named entity recognition via multi-head self-attention based BiLSTM-CRF
Ying An ... Jianxin Wang
Artificial Intelligence in Medicine | VOL. 127
Ying An, et. al.Ying An ... Jianxin Wang
18 Mar 2022
Artificial Intelligence in Medicine | VOL. 127

Chinese Clinical Named Entity Recognition Based on Stroke-Level and Radical-Level Features
Feng Zhou ... Mingyang Li
-
Feng Zhou, et. al.Feng Zhou ... Mingyang Li
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-task learning for Chinese clinical named entity recognition with external knowledge

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making