A Feature-Enhanced Entity Recognition Method for Chinese Electronic Medical Records

Beibei Zhang,Mingming Lu,Yu Fang

doi:10.1109/itme.2018.00014

Abstract

Electronic medical records (EMRs) contain rich medical information, which is of great significance to medical research. The amount of Chinese EMRs is growing, whereas the current named entity recognition methods based on machine learning do not consider the unique characteristics of Chinese EMR. In this paper, four types of entities for disease, symptom, inspection and treatment are trained and tested using the conditional random field model. Firstly, tag-of-words, part-of-speech and context are selected as the basic features. Secondly, by analyzing the characteristics of Chinese electronic medical record text, the chapter name feature, core word feature and word clustering feature are selected as the extended features. Among them, the core word feature is obtained by dividing the collected dictionary into characters and words and then counting the character frequency and word frequency. The word vector clustering feature is obtained by clustering word vectors. Then, by constructing a medical dictionary, a semi-automatic corpus annotation method is used to randomly extract and classify the corpora of a certain scale. Finally, using the conditional random field tool CRF++ to learn and predict, it achieves an accuracy of 93.03%, a recall rate of 90.69%, and an F value of 91.85%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Feature-Enhanced Entity Recognition Method for Chinese Electronic Medical Records

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records
An Fang ... Ming Feng
BMC Medical Informatics and Decision Making | VOL. 22
An Fang, et. al.An Fang ... Ming Feng
23 Mar 2022
BMC Medical Informatics and Decision Making | VOL. 22

Word Embedding Bootstrapped Deep Active Learning Method to Information Extraction on Chinese Electronic Medical Record
Qunsheng Ma ... Xingxing Cen
Journal of Shanghai Jiaotong University (Science) | VOL. 26
Qunsheng Ma, et. al.Qunsheng Ma ... Xingxing Cen
31 Mar 2021
Journal of Shanghai Jiaotong University (Science) | VOL. 26

Relation Extraction Based on Fusion Dependency Parsing from Chinese EMRs
Pengjun Zhai ... Xin Huang
Scientific Programming | VOL. 2020
Pengjun Zhai, et. al.Pengjun Zhai ... Xin Huang
08 Jun 2020
Scientific Programming | VOL. 2020

Time Recognition of Chinese Electronic Medical Record of Depression Based on Conditional Random Field
Shaofu Lin ... Yuanyuan Zhao
-
Shaofu Lin, et. al.Shaofu Lin ... Yuanyuan Zhao
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Feature-Enhanced Entity Recognition Method for Chinese Electronic Medical Records

Abstract

Talk to us

Similar Papers