Boosted negative sampling by quadratically constrained entropy maximization

Taygun Kekeç,David Mimno,David M.J Tax

doi:10.1016/j.patrec.2019.04.027

Abstract

Learning probability densities for natural language representations is a difficult problem because language is inherently sparse and high-dimensional. Negative sampling is a popular and effective way to avoid intractable maximum likelihood problems, but it requires correct specification of the sampling distribution. Previous state of the art methods rely on heuristic distributions that appear to do well in practice. In this work, we define conditions for optimal sampling distributions and demonstrate how to approximate them using Quadratically Constrained Entropy Maximization(QCEM). Our analysis shows that state of the art heuristics are restrictive approximations to our proposed framework. To demonstrate the merits of our formulation, we apply QCEM to matching synthetic exponential family distributions and to finding high dimensional word embedding vectors for English. We are able to achieve faster inference on synthetic experiments and improve the correlation on semantic similarity evaluations on the Rare Words dataset by 4.8%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Boosted negative sampling by quadratically constrained entropy maximization

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: May 1, 2019
Citations: 2

Similar Papers

Leveraging High Dimensional Spatial Graph Embedding as a Heuristic for Graph Algorithms
Peter Oostema ... Franz Franchetti
-
Peter Oostema, et. al.Peter Oostema ... Franz Franchetti
01 Jun 2021
01 Jun 2021

Entity Type Recognition Using an Ensemble of Distributional Semantic Models to Enhance Query Understanding
Walid Shalaby ... Trey Grainger
-
Walid Shalaby, et. al.Walid Shalaby ... Trey Grainger
04 Apr 2016
04 Apr 2016

UPSNet: Universal Point Cloud Sampling Network Without Knowing Downstream Tasks
Fujing Tian ... Wenxu Tao
Information Technology and Control | VOL. 51
Fujing Tian, et. al.Fujing Tian ... Wenxu Tao
12 Dec 2022
Information Technology and Control | VOL. 51

Brain Inspired One Shot Learning Method for HD Computing
Devika R Nair ... A Purushothaman
-
Devika R Nair, et. al.Devika R Nair ... A Purushothaman
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Boosted negative sampling by quadratically constrained entropy maximization

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters