Parsing Clinical Text: How Good are the state-of-the-art Deep Learning Based Parsers?

Yaoyun Zhang,Hua Xu,Min Jiang,Firat Tiryaki

doi:10.1109/ichi-w.2018.00029

Abstract

A dependency parser generates both a syntactic structure and a shallow semantic structure of a sentence. It is a fundamental component of natural language processing (NLP) based pipelines, which are critical to facilitate research using the Electronic Health Records (EHR). However, current works mainly apply parsers developed in the general English domain to clinical text. There are no formal evaluations and comparisons of deep learning based dependency parsers in the medical domain. No state-of-the-art dependency parsing performance has been established on clinical text, either. In this study, we investigated the performance of four state-ofthe-art deep learning based dependency parsers, Stanford parser, Bist-parser, dependency_tf parser and jPTDP parser, respectively. Experiments for evaluation are conducted on two datasets: (1) The MiPACQ Treebank and (2) A Treebank of progress notes. Our results showed that the original parsers achieved lower performance in clinical text compared to general English text. After retraining on the clinical Treebank, all parsers obtained better performance. Besides, using word embeddings from Gigaword and MIMICIII yielded comparable performance. Interestingly, the transition-based parsers demonstrated stronger generalizability on different treebanks than the graph-based parsers. Overall, Bist-parser achieved the best performance on MiPACQ (88.95% UAS, 92.69% LS, 86.10% LAS). Stanford parser achieved the best performance on progress notes (84.01% UAS, 89/97% LS, 80.72% LAS).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parsing Clinical Text: How Good are the state-of-the-art Deep Learning Based Parsers?

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Parsing clinical text: how good are the state-of-the-art parsers?
Min Jiang ... Yang Huang
BMC Medical Informatics and Decision Making | VOL. Suppl 15 1
Min Jiang, et. al.Min Jiang ... Yang Huang
20 May 2015
BMC Medical Informatics and Decision Making | VOL. Suppl 15 1

An initial study of full parsing of clinical text using the Stanford Parser
Hua Xu ... Min Jiang
-
Hua Xu, et. al. Hua Xu ... Min Jiang
01 Nov 2011
01 Nov 2011

Combining Contextualized Embeddings and Prior Knowledge for Clinical Named Entity Recognition: Evaluation Study.
Min Jiang ... Todd Sanger
JMIR Medical Informatics | VOL. 7
Min Jiang, et. al.Min Jiang ... Todd Sanger
13 Nov 2019
JMIR Medical Informatics | VOL. 7

Carrell et al. Respond to "Observational Research and the EHR"
D S Carrell ... D S M Buist
American Journal of Epidemiology | VOL. 179
D S Carrell, et. al.D S Carrell ... D S M Buist
30 Jan 2014
American Journal of Epidemiology | VOL. 179

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parsing Clinical Text: How Good are the state-of-the-art Deep Learning Based Parsers?

Abstract

Talk to us

Similar Papers