Intrinsic or Extrinsic Evaluation: An Overview of Word Embedding Evaluation

Yong Shi,Kun Guo,Yi Qu,Luyao Zhu,Yuanchun Zheng

doi:10.1109/icdmw.2018.00179

Abstract

Compared with traditional methods, word em-bedding is an efficient language representation that can learn syntax and semantics by using neural networks. As the result, more and more promising experiments in natural language processing (NLP) get the state-of-the-art results by introducing word embedding. In principle, embedding representation learning embeds words to a low-dimensional vector space, there-fore vectors support initialization of NLP tasks such as text classification, sentiment analysis, language understanding, etc. However, polysemy is very common in many languages, which causes word ambiguation, further influences the accuracy of the system. Additionally, language models based on distributed hypotheses mostly focused on word properties rather than morphology were our primary focus. This leads to unreasonable performance in different evaluations. At the same time, word embedding learning and measuring are two vital components of word representation. In this paper, we overviewed many language models including single sense and multiple sense word embedding, and many evaluated approaches including intrinsic and extrinsic evaluation. We found that there are obvious gaps between vectors and manual annotations in word similarity evaluation, and language models that achieved good performance in intrinsic evaluations could not produce similar results in extrinsic evaluations. To the best of our knowledge, there is no universal language model and embedding learning method for most NLP task, and each evaluations also hidden natural defects compared to human knowledge. More evaluated datasets are also investigated such as datasets used in intrinsic and extrinsic evaluations. We believe that an improved evaluation dataset and a more rational evaluation method would benefit from this overview.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intrinsic or Extrinsic Evaluation: An Overview of Word Embedding Evaluation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

Learning class-specific word embeddings
Sicong Kuang ... Brian D Davison
The Journal of Supercomputing | VOL. 76
Sicong Kuang, et. al.Sicong Kuang ... Brian D Davison
23 Oct 2019
The Journal of Supercomputing | VOL. 76

Learned Text Representation for Amharic Information Retrieval and Natural Language Processing
Tilahun Yeshambel ... Josiane Mothe
Information | VOL. 14
Tilahun Yeshambel, et. al.Tilahun Yeshambel ... Josiane Mothe
20 Mar 2023
Information | VOL. 14

Impact of word embedding models on text analytics in deep learning environment: a review.
Deepak Suresh Asudani ... Naresh Kumar Nagwani
Artificial Intelligence Review | VOL. 56
Deepak Suresh Asudani, et. al.Deepak Suresh Asudani ... Naresh Kumar Nagwani
22 Feb 2023
Artificial Intelligence Review | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intrinsic or Extrinsic Evaluation: An Overview of Word Embedding Evaluation

Abstract

Talk to us

Similar Papers