Improving textual medication extraction using combined conditional random fields and rule-based systems

Domonkos Tikk,Illés Solt

doi:10.1136/jamia.2010.004119

Abstract

In the i2b2 Medication Extraction Challenge, medication names together with details of their administration were to be extracted from medical discharge summaries. The task of the challenge was decomposed into three pipelined components: named entity identification, context-aware filtering and relation extraction. For named entity identification, first a rule-based (RB) method that was used in our overall fifth place-ranked solution at the challenge was investigated. Second, a conditional random fields (CRF) approach is presented for named entity identification (NEI) developed after the completion of the challenge. The CRF models are trained on the 17 ground truth documents, the output of the rule-based NEI component on all documents, a larger but potentially inaccurate training dataset. For both NEI approaches their effect on relation extraction performance was investigated. The filtering and relation extraction components are both rule-based. In addition to the official entry level evaluation of the challenge, entity level analysis is also provided. On the test data an entry level F(1)-score of 80% was achieved for exact matching and 81% for inexact matching with the RB-NEI component. The CRF produces a significantly weaker result, but CRF outperforms the rule-based model with 81% exact and 82% inexact F(1)-score (p<0.02). This study shows that a simple rule-based method is on a par with more complicated machine learners; CRF models can benefit from the addition of the potentially inaccurate training data, when only very few training documents are available. Such training data could be generated using the outputs of rule-based methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving textual medication extraction using combined conditional random fields and rule-based systems

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association

Lead the way for us

Journal: Journal of the American Medical Informatics Association	Publication Date: Sep 1, 2010
Citations: 33

Similar Papers

An Approach of Chunk Parsing and Entity Relation Extracting to Chinese Based on Conditional Random Fields Model
Jun-Hua Wu ... Jing Zhou
-
Jun-Hua Wu, et. al.Jun-Hua Wu ... Jing Zhou
01 Nov 2008
01 Nov 2008

Chunk Parsing and Entity Relation Extracting to Chinese Text by Using Conditional Random Fields Model
Junhua Wu ... Longxia Liu
Journal of Intelligent Learning Systems and Applications | VOL. 02
Junhua Wu, et. al.Junhua Wu ... Longxia Liu
01 Jan 2009
Journal of Intelligent Learning Systems and Applications | VOL. 02

Decision letter: Graphical-model framework for automated annotation of cell identities in dense cellular images
Ronald L Calabrese
-
Ronald L CalabreseRonald L Calabrese
24 Aug 2020
24 Aug 2020

The application effect of the Rasch measurement model combined with the CRF model: An analysis based on English discourse.
Yunxia Wang
PloS one | VOL. 19
Yunxia WangYunxia Wang
01 Jan 2024
PloS one | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving textual medication extraction using combined conditional random fields and rule-based systems

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association