Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging

Houquan Zhou,Yang Li,Min Zhang,Zhenghua Li

doi:10.18653/v1/2022.findings-acl.259

Abstract

In recent years, large-scale pre-trained language models (PLMs) have made extraordinary progress in most NLP tasks. But, in the unsupervised POS tagging task, works utilizing PLMs are few and fail to achieve state-of-the-art (SOTA) performance. The recent SOTA performance is yielded by a Guassian HMM variant proposed by He et al. (2018). However, as a generative model, HMM makes very strong independence assumptions, making it very challenging to incorporate contexualized word representations from PLMs. In this work, we for the first time propose a neural conditional random field autoencoder (CRF-AE) model for unsupervised POS tagging. The discriminative encoder of CRF-AE can straightforwardly incorporate ELMo word representations. Moreover, inspired by feature-rich HMM, we reintroduce hand-crafted features into the decoder of CRF-AE. Finally, experiments clearly show that our model outperforms previous state-of-the-art models by a large margin on Penn Treebank and multilingual Universal Dependencies treebank v2.0.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging
...
-
, et. al. ...
11 May 2022
11 May 2022

A Multi-tasking and Multi-stage Chinese Minority Pre-trained Language Model
Bin Li ... Bin Sun
-
Bin Li, et. al.Bin Li ... Bin Sun
01 Jan 2021
01 Jan 2021

Investigating Pre-trained Language Models on Cross-Domain Datasets, a Step Closer to General AI
Mohamad Ballout ... Kai-Uwe Kühnberger
Procedia Computer Science | VOL. 222
Mohamad Ballout, et. al.Mohamad Ballout ... Kai-Uwe Kühnberger
01 Jan 2023
Procedia Computer Science | VOL. 222

JointMatcher: Numerically-aware entity matching using pre-trained language models with attention concentration
Chen Ye ... Guojun Dai
Knowledge-Based Systems | VOL. 251
Chen Ye, et. al.Chen Ye ... Guojun Dai
16 May 2022
Knowledge-Based Systems | VOL. 251

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging

Abstract

Talk to us

Similar Papers