Fast PMI-Based Word Embedding with Efficient Use of Unobserved Patterns

Behrouz Haji Soleimani,Stan Matwin

doi:10.1609/aaai.v33i01.33017031

Abstract

Continuous word representations that can capture the semantic information in the corpus are the building blocks of many natural language processing tasks. Pre-trained word embeddings are being used for sentiment analysis, text classification, question answering and so on. In this paper, we propose a new word embedding algorithm that works on a smoothed Positive Pointwise Mutual Information (PPMI) matrix which is obtained from the word-word co-occurrence counts. One of our major contributions is to propose an objective function and an optimization framework that exploits the full capacity of “negative examples”, the unobserved or insignificant wordword co-occurrences, in order to push unrelated words away from each other which improves the distribution of words in the latent space. We also propose a kernel similarity measure for the latent space that can effectively calculate the similarities in high dimensions. Moreover, we propose an approximate alternative to our algorithm using a modified Vantage Point tree and reduce the computational complexity of the algorithm to |V |log|V | with respect to the number of words in the vocabulary. We have trained various word embedding algorithms on articles of Wikipedia with 2.1 billion tokens and show that our method outperforms the state-of-the-art in most word similarity tasks by a good margin.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast PMI-Based Word Embedding with Efficient Use of Unobserved Patterns

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 3

Similar Papers

A knowledge-enriched ensemble method for word embedding and multi-sense embedding
Lanting Fang ... Kaiqi Zhao
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Lanting Fang, et. al.Lanting Fang ... Kaiqi Zhao
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Learning class-specific word embeddings
Sicong Kuang ... Brian D Davison
The Journal of Supercomputing | VOL. 76
Sicong Kuang, et. al.Sicong Kuang ... Brian D Davison
23 Oct 2019
The Journal of Supercomputing | VOL. 76

Spectral Word Embedding with Negative Sampling
Behrouz Haji Soleimani ... Stan Matwin
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32
Behrouz Haji Soleimani, et. al.Behrouz Haji Soleimani ... Stan Matwin
27 Apr 2018
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32

Pre-Trained Multi-View Word Embedding Using Two-Side Neural Network
Yong Luo ... Jun Yan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 28
Yong Luo, et. al.Yong Luo ... Jun Yan
21 Jun 2014
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast PMI-Based Word Embedding with Efficient Use of Unobserved Patterns

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence