Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records

An Fang,Xianlai Chen,Jiahui Hu,Wanqing Zhao,Pei Lou,Huiling Ren,Ji Fu,Shanshan Feng,Ming Feng

doi:10.1186/s12911-022-01810-z

Abstract

ObjectivePituitary adenomas are the most common type of pituitary disorders, which usually occur in young adults and often affect the patient’s physical development, labor capacity and fertility. Clinical free texts noted in electronic medical records (EMRs) of pituitary adenomas patients contain abundant diagnosis and treatment information. However, this information has not been well utilized because of the challenge to extract information from unstructured clinical texts. This study aims to enable machines to intelligently process clinical information, and automatically extract clinical named entity for pituitary adenomas from Chinese EMRs.MethodsThe clinical corpus used in this study was from one pituitary adenomas neurosurgery treatment center of a 3A hospital in China. Four types of fine-grained texts of clinical records were selected, which included notes from present illness, past medical history, case characteristics and family history of 500 pituitary adenoma inpatients. The dictionary-based matching, conditional random fields (CRF), bidirectional long short-term memory with CRF (BiLSTM-CRF), and bidirectional encoder representations from transformers with BiLSTM-CRF (BERT-BiLSTM-CRF) were used to extract clinical entities from a Chinese EMRs corpus. A comprehensive dictionary was constructed based on open source vocabularies and a domain dictionary for pituitary adenomas to conduct the dictionary-based matching method. We selected features such as part of speech, radical, document type, and the position of characters to train the CRF-based model. Random character embeddings and the character embeddings pretrained by BERT were used respectively as the input features for the BiLSTM-CRF model and the BERT-BiLSTM-CRF model. Both strict metric and relaxed metric were used to evaluate the performance of these methods.ResultsExperimental results demonstrated that the deep learning and other machine learning methods were able to automatically extract clinical named entities, including symptoms, body regions, diseases, family histories, surgeries, medications, and disease courses of pituitary adenomas from Chinese EMRs. With regard to overall performance, BERT-BiLSTM-CRF has the highest strict F1 value of 91.27% and the highest relaxed F1 value of 95.57% respectively. Additional evaluations showed that BERT-BiLSTM-CRF performed best in almost all entity recognition except surgery and disease course. BiLSTM-CRF performed best in disease course entity recognition, and performed as well as the CRF model for part of speech, radical and document type features, with both strict and relaxed F1 value reaching 96.48%. The CRF model with part of speech, radical and document type features performed best in surgery entity recognition with relaxed F1 value of 95.29%.ConclusionsIn this study, we conducted four entity recognition methods for pituitary adenomas based on Chinese EMRs. It demonstrates that the deep learning methods can effectively extract various types of clinical entities with satisfying performance. This study contributed to the clinical named entity extraction from Chinese neurosurgical EMRs. The findings could also assist in information extraction in other Chinese medical texts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Mar 23, 2022
Citations: 8	License type: open-access

R Discovery Prime

R Discovery Prime

Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records

Abstract

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

Medical Text Entity Recognition Based on CRF and Joint Entity
Yong Li ... Qixian Ma
-
Yong Li, et. al.Yong Li ... Qixian Ma
14 Apr 2021
14 Apr 2021

A Feature-Enhanced Entity Recognition Method for Chinese Electronic Medical Records
Beibei Zhang ... Yu Fang
-
Beibei Zhang, et. al.Beibei Zhang ... Yu Fang
01 Oct 2018
01 Oct 2018

Combining External Medical Knowledge for Improving Obstetric Intelligent Diagnosis: Model Development and Validation
Kunli Zhang ... Tao Liu
JMIR Medical Informatics | VOL. 9
Kunli Zhang, et. al.Kunli Zhang ... Tao Liu
10 May 2021
JMIR Medical Informatics | VOL. 9

Entity relationship extraction from Chinese electronic medical records based on feature augmentation and cascade binary tagging framework.
Xiaoqing Lu ... Shudong Xia
Mathematical Biosciences and Engineering | VOL. 21
Xiaoqing Lu, et. al.Xiaoqing Lu ... Shudong Xia
01 Jan 2023
Mathematical Biosciences and Engineering | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records

Abstract

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making