Obtaining Better Word Representations via Language Transfer

Changliang Li,Bo Xu,Gaowei Wu,Wendong Ge,Xiuying Wang,Yan Li

doi:10.1007/978-3-642-54906-9_11

Abstract

Vector space word representations have gained big success recently at improving performance across various NLP tasks. However, existing word embeddings learning methods only utilize homo-lingual corpus. Inspired by transfer learning, we propose a novel language transfer method to obtain word embeddings via language transfer. Under this method, in order to obtain word embeddings of one language (target language), we train models on corpus of another different language (source language) instead. And then we use the obtained source language word embeddings to represent target language word embeddings. We evaluate the word embeddings obtained by the proposed method on word similarity tasks across several benchmark datasets. And the results show that our method is surprisingly effective, outperforming competitive baselines by a large margin. Another benefit of our method is that the process of collecting new corpus might be skipped.KeywordsTarget WordLanguage ModelTarget LanguageBenchmark DatasetTransfer LearningThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Obtaining Better Word Representations via Language Transfer

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Word Embeddings via Combining with Complementary Languages
Changliang Li ... Xiuying Wang
-
Changliang Li, et. al.Changliang Li ... Xiuying Wang
01 Jan 2014
01 Jan 2014

Using Word Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval
Paheli Bhattacharya ... Pawan Goyal
Computación y Sistemas | VOL. 20
Paheli Bhattacharya, et. al.Paheli Bhattacharya ... Pawan Goyal
30 Sep 2016
Computación y Sistemas | VOL. 20

Can Network Embedding of Distributional Thesaurus Be Combined with Word Vectors for Better Representation?
Abhik Jana ... Pawan Goyal
-
Abhik Jana, et. al.Abhik Jana ... Pawan Goyal
01 Jan 2018
01 Jan 2018

LSTMEmbed: Learning Word and Sense Representations from a Large Semantically Annotated Corpus with Long Short-Term Memories
Ignacio Iacobacci ... Roberto Navigli
-
Ignacio Iacobacci, et. al.Ignacio Iacobacci ... Roberto Navigli
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Obtaining Better Word Representations via Language Transfer

Abstract

Talk to us

Similar Papers