Effective Dimensionality Reduction for Word Embeddings

Vikas Raunak,Vivek Gupta,Florian Metze

doi:10.18653/v1/w19-4328

Abstract

Pre-trained word embeddings are used in several downstream applications as well as for constructing representations for sentences, paragraphs and documents. Recently, there has been an emphasis on improving the pretrained word vectors through post-processing algorithms. One improvement area is reducing the dimensionality of word embeddings. Reducing the size of word embeddings can improve their utility in memory constrained devices, benefiting several real world applications. In this work, we present a novel technique that efficiently combines PCA based dimensionality reduction with a recently proposed post-processing algorithm (Mu and Viswanath, 2018), to construct effective word embeddings of lower dimensions. Empirical evaluations on several benchmarks show that our algorithm efficiently reduces the embedding size while achieving similar or (more often) better performance than original embeddings. We have released the source code along with this paper.

Highlights

Word embeddings such as Glove (Pennington et al, 2014) and word2vec Skip-Gram (Mikolov et al, 2013) obtained from unlabeled text corpora can represent words in distributed dense realvalued low dimensional vectors which geometrically capture the semantic ‘meaning’ of a word
Spearman’s rank correlation coefficient (Rho × 100) between the ranks produced by using the word vectors and the human rankings is used for the evaluation
Evaluation Results: First we evaluate our algorithm on the same embeddings against the 3 baselines, we evaluate our algorithm across word embeddings of different dimensions and different types

Summary

Introduction

Word embeddings such as Glove (Pennington et al, 2014) and word2vec Skip-Gram (Mikolov et al, 2013) obtained from unlabeled text corpora can represent words in distributed dense realvalued low dimensional vectors which geometrically capture the semantic ‘meaning’ of a word. These embeddings capture several linguistic regularities such as analogy relationships. A major issue related with word embeddings is their size (Ling et al, 2016), e.g., loading a word embedding matrix of 2.5 M tokens takes up to 6 GB memory (for 300-dimensional vectors, on a 64-bit system). In this work we combine the simple dimensionality reduction technique, PCA with the post processing technique of (Mu and Viswanath, 2018), as discussed above

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effective Dimensionality Reduction for Word Embeddings

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 91	License type: cc-by

Similar Papers

A comparison of word embeddings for the biomedical natural language processing.
Yanshan Wang ... Naveed Afzal
Journal of Biomedical Informatics | VOL. 87
Yanshan Wang, et. al.Yanshan Wang ... Naveed Afzal
12 Sep 2018
Journal of Biomedical Informatics | VOL. 87

Adapting Pre-trained Word Embeddings For Use In Medical Coding
Kevin Patel ... Divya Patel
-
Kevin Patel, et. al.Kevin Patel ... Divya Patel
01 Jan 2017
01 Jan 2017

The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings
Binny Mathew ... Markus Strohmaier
-
Binny Mathew, et. al.Binny Mathew ... Markus Strohmaier
20 Apr 2020
20 Apr 2020

Improving Word Embedding Using Variational Dropout
Zainab Albujasim ... Yuhong Guo
The International FLAIRS Conference Proceedings | VOL. 36
Zainab Albujasim, et. al.Zainab Albujasim ... Yuhong Guo
08 May 2023
The International FLAIRS Conference Proceedings | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effective Dimensionality Reduction for Word Embeddings

Abstract

Highlights

Summary

Talk to us

Similar Papers