A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing

Yang Hou

doi:10.48448/93ht-0d63

Abstract

The most straightforward approach to joint word segmentation (WS), part-of-speech (POS) tagging, and constituent parsing (PAR) is converting a word-level tree into a char-level tree, which, however, leads to two severe challenges. First, a larger label set (e.g., >= 600) and longer inputs both increase computational cost. Second, it is difficult to rule out illegal trees containing conflicting production rules, which is important for reliable model evaluation. If a POS tag (like VV) is above a phrase tag (like VP) in the output tree, it becomes quite complex to decide word boundaries. To deal with both challenges, this work proposes a two-stage coarse-to-fine labeling framework for joint WS-POS-PAR. In the coarse labeling stage, the joint model outputs a bracketed tree, in which each node corresponds to one of four labels (i.e., phrase, subphrase, word, subword). The tree is guaranteed to be legal via constrained CKY decoding. In the fine labeling stage, the model expands each coarse label into a final label (such as VP, VP*, VV, VV*). Experiments on Chinese Penn Treebank 5.1 and 7.0 show that our joint model consistently outperforms the pipeline approach on both settings of without and with BERT, and achieves new state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing
Yang Hou ... Zhefeng Wang
-
Yang Hou, et. al.Yang Hou ... Zhefeng Wang
01 Jan 2020
01 Jan 2020

A Novel Part of Speech Tagging Framework for NLP Based Business Process Management
Xue Han ... Lijun Mei
-
Xue Han, et. al.Xue Han ... Lijun Mei
01 Jul 2019
01 Jul 2019

Part of speech tagging for Arabic
Sandra Kübler ... Emad Mohamed
Natural Language Engineering | VOL. 18
Sandra Kübler, et. al.Sandra Kübler ... Emad Mohamed
06 Dec 2011
Natural Language Engineering | VOL. 18

An improved joint model: POS tagging and dependency parsing
...
Journal of Artificial Intelligence and Data Mining | VOL. 4
, et. al. ...
01 Jan 2015
Journal of Artificial Intelligence and Data Mining | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing

Abstract

Talk to us

Similar Papers