Adversarial Contrastive Estimation

Avishek Joey Bose,Yanshuai Cao,Huan Ling

doi:10.18653/v1/p18-1094

Abstract

Learning by contrasting positive and negative samples is a general strategy adopted by many methods. Noise contrastive estimation (NCE) for word embeddings and translating embeddings for knowledge graphs are examples in NLP employing this approach. In this work, we view contrastive learning as an abstraction of all such methods and augment the negative sampler into a mixture distribution containing an adversarially learned sampler. The resulting adaptive sampler finds harder negative examples, which forces the main model to learn a better representation of the data. We evaluate our proposal on learning word embeddings, order embeddings and knowledge graph embeddings and observe both faster convergence and improved results on multiple metrics.

Highlights

Many models learn by contrasting losses on observed positive examples with those on some fictitious negative examples, trying to decrease some score on positive ones while increasing it on negative ones
To remedy the above mentioned problem of a fixed unconditional negative sampler, we propose to augment it into a mixture one, λpnce(y) + (1 − λ)gθ(y|x), where gθ is a conditional distribution with a learnable parameter θ and λ is a hyperparameter
We evaluate models trained from scratch as well as fine-tuned Glove models (Pennington et al, 2014) on word similarity tasks that consist of computing the similarity

Summary

Introduction

Many models learn by contrasting losses on observed positive examples with those on some fictitious negative examples, trying to decrease some score on positive ones while increasing it on negative ones. In noise contrastive estimation for word embeddings, a negative example is formed by replacing a component of a positive pair by randomly selecting a sampled word from the vocabulary, resulting in a fictitious word-context pair which would be unlikely to exist in the dataset This negative sampling by corruption approach is used in learning knowledge graph embeddings (Bordes et al, 2013; Lin et al, 2015; Ji et al, 2015; Wang et al, 2014; Trouillon et al, 2016; Yang et al, 2014; Dettmers et al, 2017), order embeddings (Vendrov et al, 2016), caption generation (Dai and Lin, 2017), etc. We demonstrate the efficacy and generality of the proposed method on three different learning tasks, word embeddings (Mikolov et al, 2013), order embeddings (Vendrov et al, 2016) and knowledge graph embeddings (Ji et al, 2015)

Background: contrastive learning

Adversarial mixture noise

Learning the generator

Entropy and training stability

Handling false negatives

Variance Reduction

Improving exploration in gθ by leveraging NCE samples

Related Work

Word Embeddings

Order Embeddings Hypernym Prediction

Knowledge Graph Embeddings

Experiments

Training Word Embeddings from scratch

Finetuning Word Embeddings

Hypernym Prediction

Ablation Study and Improving TransD

Limitations

Hard Negative Analysis

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial Contrastive Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 58	License type: cc-by

Similar Papers

Fine-Grained Evaluation of Knowledge Graph Embedding Models in Downstream Tasks
Yuxin Zhang ... Bohan Li
-
Yuxin Zhang, et. al.Yuxin Zhang ... Bohan Li
01 Jan 2020
01 Jan 2020

Fine-Grained Evaluation of Knowledge Graph Embedding Model in Knowledge Enhancement Downstream Tasks
Yuxin Zhang ... Han Yang
Big Data Research | VOL. 25
Yuxin Zhang, et. al.Yuxin Zhang ... Han Yang
02 Mar 2021
Big Data Research | VOL. 25

Hyperplane-based time-aware knowledge graph embedding for temporal knowledge graph completion
Peng He ... Gang Zhou
Journal of Intelligent & Fuzzy Systems | VOL. 42
Peng He, et. al.Peng He ... Gang Zhou
28 Apr 2022
Journal of Intelligent & Fuzzy Systems | VOL. 42

Knowledge Graph Embedding With Interactive Guidance From Entity Descriptions
Wen'An Zhou ... Shirui Wang
IEEE Access | VOL. 7
Wen'An Zhou, et. al.Wen'An Zhou ... Shirui Wang
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Contrastive Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers