Choosing Transfer Languages for Cross-Lingual Learning

Yu-Hsiang Lin,Shruti Rijhwani,Graham Neubig,Antonios Anastasopoulos,Junxian He,Zirui Li,Patrick Littell,Xuezhe Ma,Yuyan Zhang,Chian-Yu Chen,Mengzhou Xia,Zhisong Zhang,Jean Lee

doi:10.18653/v1/p19-1301

Abstract

Cross-lingual transfer, where a high-resource transfer language is used to improve the accuracy of a low-resource task language, is now an invaluable tool for improving performance of natural language processing (NLP) on low-resource languages. However, given a particular task language, it is not clear which language to transfer from, and the standard strategy is to select languages based on ad hoc criteria, usually the intuition of the experimenter. Since a large number of features contribute to the success of cross-lingual transfer (including phylogenetic similarity, typological properties, lexical overlap, or size of available data), even the most enlightened experimenter rarely considers all these factors for the particular task at hand. In this paper, we consider this task of automatically selecting optimal transfer languages as a ranking problem, and build models that consider the aforementioned features to perform this prediction. In experiments on representative NLP tasks, we demonstrate that our model predicts good transfer languages much better than ad hoc baselines considering single features in isolation, and glean insights on what features are most informative for each different NLP tasks, which may inform future ad hoc selection even without use of our method. Code, data, and pre-trained models are available at https://github.com/neulab/langrank

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Choosing Transfer Languages for Cross-Lingual Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 78	License type: cc-by

Similar Papers

Alternative Non-BERT Model Choices for the Textual Classification in Low-Resource Languages and Environments

-

09 Jul 2022
09 Jul 2022

Multi-Topic Categorization in a Low-Resource Ewe Language: A Modern Transformer Approach
Victor Kwaku Agbesi ... Chen Wenyu
-
Victor Kwaku Agbesi, et. al.Victor Kwaku Agbesi ... Chen Wenyu
22 Apr 2022
22 Apr 2022

Cross-Lingual Transfer for Distantly Supervised and Low-Resources Indonesian NER
Fariz Ikhwantri
-
Fariz IkhwantriFariz Ikhwantri
01 Jan 2023
01 Jan 2023

Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs
Federico Cassano ... Arjun Guha
Proceedings of the ACM on Programming Languages | VOL. 8
Federico Cassano, et. al.Federico Cassano ... Arjun Guha
08 Oct 2024
Proceedings of the ACM on Programming Languages | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Choosing Transfer Languages for Cross-Lingual Learning

Abstract

Talk to us

Similar Papers