Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging

Ling Zhao,Ying Liu,Hao Fei,Ailian Zhang

doi:10.1016/j.patrec.2020.07.017

Abstract

Abstract Recent studies show that the joint Chinese word segmentation and POS tagging can enhance the mutual interaction and yield better performances for two tasks. However, existing joint methods fail to effectively take the advantage of the multiple granularity of information, e.g., character, word and subword, which has been proven prominently useful. In this paper, we propose to improve the joint tasks by leveraging such multi-granularity of information, by exploiting the lattice-LSTM and Convolutional Network (GCN) models for effectively encoding the graph information. On five benchmark datasets our proposed model shows highly competitive performances, achieving the new state-of-the-art results in the literature. Further analysis reveals that the multi-granularity information can relieve the out-of-vocabulary and the long-range dependency issues. Also the GCN structure is more effective for encoding the multi-granularity graph information, compared with the lattice structure.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Jul 13, 2020
Citations: 12

Similar Papers

A Data-Driven Model for Automated Chinese Word Segmentation and POS Tagging.
Qing Xu ... D Plewczynski
Computational intelligence and neuroscience | VOL. 2022
Qing Xu, et. al.Qing Xu ... D Plewczynski
16 Sep 2022
Computational intelligence and neuroscience | VOL. 2022

Chinese Word POS Tagging with Markov Logic
Zhihua Liao ... Qixian Zeng
-
Zhihua Liao, et. al.Zhihua Liao ... Qixian Zeng
01 Jan 2015
01 Jan 2015

Dual-chain Unequal-state CRF for Chinese new word detection and POS tagging
Xiao Sun ... Degen Huang
-
Xiao Sun, et. al.Xiao Sun ... Degen Huang
01 Oct 2008
01 Oct 2008

An error-driven word-character hybrid model for joint Chinese word segmentation and POS tagging
Canasai Kruengkrai ... Kentaro Torisawa
-
Canasai Kruengkrai, et. al.Canasai Kruengkrai ... Kentaro Torisawa
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters