A weakly supervised method for named entity recognition of Chinese electronic medical records.

Meng Li,Huajian Zhou,Kuang Zhang,Jing Ying,Chunrong Gao

doi:10.1007/s11517-023-02871-6

Abstract

The field of Chinese medical natural language processing faces a significant challenge in training accurate entity recognition models due to the limited availability of high-quality labeled data. In response, we propose a joint training model, MCBERT-GCN-CRF, which achieves high performance in identifying medical-related entities in Chinese electronic medical records. Additionally, we introduce CM-NER, a 5-step framework that effectively mitigates the effects of noise in weakly labeled data and establishes a principled connection between supervised and weakly supervised named entity recognition. We demonstrate significant improvements in recall rate and accuracy. Our approach outperforms traditional fully supervised pre-training models and other state-of-the-art methods by suppressing noise in weakly labeled data. Our proposed framework achieves an F1 score of 86.29% on the CCKS-2019 dataset, significantly higher than pre-trained model baselines ranging from 74.17 to 83.06%, and higher than the top-performing named entity recognition supervised learning models in the CCKS-2019 competition. Our results demonstrate the effectiveness of our proposed framework and highlight the potential of leveraging unlabeled data to train accurate models for named entity recognition in Chinese medical natural language processing. This research has significant implications for advancing natural language processing techniques in the medical domain and improving patient care.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A weakly supervised method for named entity recognition of Chinese electronic medical records.

Abstract

Talk to us

Similar Papers

More From: Medical & biological engineering & computing

Lead the way for us

Similar Papers

Deep Learning for Natural Language Processing
Jiajun Zhang ... Chengqing Zong
-
Jiajun Zhang, et. al.Jiajun Zhang ... Chengqing Zong
01 Jan 2019
01 Jan 2019

Chinese new word identification: a latent discriminative model with global features
...
Journal of Computer Science and Technology | VOL. 26
, et. al. ...
11 Jan 2011
Journal of Computer Science and Technology | VOL. 26

Research on Sustainable Mining Engineering
Xiao Guang Yue ... Guang Zhang
Applied Mechanics and Materials | VOL. 340
Xiao Guang Yue, et. al.Xiao Guang Yue ... Guang Zhang
01 Jul 2013
Applied Mechanics and Materials | VOL. 340

Evaluation of Typing Efficiency Using Language Model for the Chinese Typewriter
Li Weigang ... Carlos A A Rocha
-
Li Weigang, et. al.Li Weigang ... Carlos A A Rocha
01 Mar 2022
01 Mar 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A weakly supervised method for named entity recognition of Chinese electronic medical records.

Abstract

Talk to us

Similar Papers

More From: Medical & biological engineering & computing