Family member information extraction via neural sequence labeling models with different tag schemes

Hong-Jie Dai

doi:10.1186/s12911-019-0996-4

Abstract

BackgroundFamily history information (FHI) described in unstructured electronic health records (EHRs) is a valuable information source for patient care and scientific researches. Since FHI is usually described in the format of free text, the entire process of FHI extraction consists of various steps including section segmentation, family member and clinical observation extraction, and relation discovery between the extracted members and their observations. The extraction step involves the recognition of FHI concepts along with their properties such as the family side attribute of the family member concept.MethodsThis study focuses on the extraction step and formulates it as a sequence labeling problem. We employed a neural sequence labeling model along with different tag schemes to distinguish family members and their observations. Corresponding to different tag schemes, the identified entities were aggregated and processed by different algorithms to determine the required properties.ResultsWe studied the effectiveness of encoding required properties in the tag schemes by evaluating their performance on the dataset released by the BioCreative/OHNLP challenge 2018. It was observed that the proposed side scheme along with the developed features and neural network architecture can achieve an overall F1-score of 0.849 on the test set, which ranked second in the FHI entity recognition subtask.ConclusionsBy comparing with the performance of conditional random fields models, the developed neural network-based models performed significantly better. However, our error analysis revealed two challenging issues of the current approach. One is that some properties required cross-sentence inferences. The other is that the current model is not able to distinguish between the narratives describing the family members of the patient and those specifying the relatives of the patient’s family members.

Highlights

History information (FHI) described in unstructured electronic health records (EHRs) is a valuable information source for patient care and scientific researches
The extraction of Family history information (FHI) from unstructured EHRs consists of various steps [4]: 1) Section segmentation: a preprocessing step to identify the sections containing FHI; 2) Family member and clinical observation extraction: a fundamental step to recognize family member mentions and their potential clinical observations described in the corresponding sections; 3) Family member-observation relation discovery: The final step associates the extracted observations with the correct family members
Results on the test sets We submitted three runs for the Family history information extraction (FHIE) entity recognition subtask with each corresponding to the model with side scheme (Run 1), the model with relation-side scheme (Run 2) and the baseline model (Run 3), respectively

Summary

Introduction

History information (FHI) described in unstructured electronic health records (EHRs) is a valuable information source for patient care and scientific researches. Since FHI is usually described in the format of free text, the entire process of FHI extraction consists of various steps including section segmentation, family member and clinical observation extraction, and relation discovery between the extracted members and their observations.

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Informatics and Decision Making	Publication Date: Dec 1, 2019
Citations: 12	License type: open-access

R Discovery Prime

R Discovery Prime

Family member information extraction via neural sequence labeling models with different tag schemes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making

Lead the way for us

Similar Papers

Long short-term memory RNN for biomedical named entity recognition
Chen Lyu ... Donghong Ji
BMC Bioinformatics | VOL. 18
Chen Lyu, et. al.Chen Lyu ... Donghong Ji
30 Oct 2017
BMC Bioinformatics | VOL. 18

Big Data, Predictive Analytics, and Quality Improvement in Kidney Transplantation: A Proof of Concept.
T.R Srinivas ... G Mour
American Journal of Transplantation | VOL. 17
T.R Srinivas, et. al.T.R Srinivas ... G Mour
04 Jan 2017
American Journal of Transplantation | VOL. 17

Towards improving the robustness of sequential labeling models against typographical adversarial examples using triplet loss
Can Udomcharoenchaikit ... Prachya Boonkwan
Natural Language Engineering | VOL. 29
Can Udomcharoenchaikit, et. al.Can Udomcharoenchaikit ... Prachya Boonkwan
04 Feb 2022
Natural Language Engineering | VOL. 29

A Sequence Labeling Based Approach for Character Segmentation of Historical Documents
Liangcai Gao ... Xiaode Zhang
-
Liangcai Gao, et. al.Liangcai Gao ... Xiaode Zhang
01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Family member information extraction via neural sequence labeling models with different tag schemes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Informatics and Decision Making