Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections

Junxian He,Graham Neubig,Zhisong Zhang,Taylor Berg-Kirkpatrick

doi:10.18653/v1/p19-1311

Abstract

Cross-lingual transfer is an effective way to build syntactic analysis tools in low-resource languages. However, transfer is difficult when transferring to typologically distant languages, especially when neither annotated target data nor parallel corpora are available. In this paper, we focus on methods for cross-lingual transfer to distant languages and propose to learn a generative model with a structured prior that utilizes labeled source data and unlabeled target data jointly. The parameters of source model and target model are softly shared through a regularized log likelihood objective. An invertible projection is employed to learn a new interlingual latent embedding space that compensates for imperfect cross-lingual word embedding input. We evaluate our method on two syntactic tasks: part-of-speech (POS) tagging and dependency parsing. On the Universal Dependency Treebanks, we use English as the only source corpus and transfer to a wide range of target languages. On the 10 languages in this dataset that are distant from English, our method yields an average of 5.2% absolute improvement on POS tagging and 8.3% absolute improvement on dependency parsing over a direct transfer method using state-of-the-art discriminative models.

Highlights

Current top performing systems on syntactic analysis tasks such as part-of-speech (POS) tagging and dependency parsing rely heavily on largescale annotated data (Huang et al, 2015; Dozat and Manning, 2017; Ma et al, 2018)
We describe how to apply this method to two syntactic analysis tasks: POS tagging with a hidden Markov model (HMM) prior and dependency parsing with a dependency model
Unsupervised adaptation helps less when transferring to nearby languages (5.9% improvement over Flow-Fix versus 11.3% on distant languages), we posit that this is because a large portion of linguistic knowledge is shared between similar languages, and the cross-lingual word embeddings have better quality in this case, so unsupervised adaptation becomes less necessary

Summary

Introduction

Current top performing systems on syntactic analysis tasks such as part-of-speech (POS) tagging and dependency parsing rely heavily on largescale annotated data (Huang et al, 2015; Dozat and Manning, 2017; Ma et al, 2018). In the case of zero-shot transfer (i.e. with no target-side supervision), a common practice is to train a strong supervised system on the source language and directly apply it to the target language over these shared embedding or POS spaces. This method has demonstrated promising results, for transfer of models to closely related target languages (Ahmad et al, 2019; Schuster et al, 2019).

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 48	License type: cc-by

Similar Papers

On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing
Wasi Ahmad ... Kai-Wei Chang
-
Wasi Ahmad, et. al.Wasi Ahmad ... Kai-Wei Chang
01 Jan 2019
01 Jan 2019

An Improved Neural Network Model for Joint
Dat Quoc Nguyen ... Karin Verspoor
-
Dat Quoc Nguyen, et. al.Dat Quoc Nguyen ... Karin Verspoor
01 Jan 2018
01 Jan 2018

Cross-lingual Multi-Level Adversarial Transfer to Enhance Low-Resource Name Tagging
Lifu Huang ... Heng Ji
-
Lifu Huang, et. al.Lifu Huang ... Heng Ji
01 Jan 2019
01 Jan 2019

An improved joint model: POS tagging and dependency parsing
...
Journal of Artificial Intelligence and Data Mining | VOL. 4
, et. al. ...
01 Jan 2015
Journal of Artificial Intelligence and Data Mining | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections

Abstract

Highlights

Summary

Talk to us

Similar Papers