Characterizing the Impact of Geometric Properties of Word Embeddings on Task Performance

Brendan Whitaker,Denis Newman-Griffis,Aparajita Haldar,Hakan Ferhatosmanoglu,Eric Fosler-Lussier

doi:10.18653/v1/w19-2002

Abstract

Analysis of word embedding properties to inform their use in downstream NLP tasks has largely been studied by assessing nearest neighbors. However, geometric properties of the continuous feature space contribute directly to the use of embedding features in downstream models, and are largely unexplored. We consider four properties of word embedding geometry, namely: position relative to the origin, distribution of features in the vector space, global pairwise distances, and local pairwise distances. We define a sequence of transformations to generate new embeddings that expose subsets of these properties to downstream models and evaluate change in task performance to understand the contribution of each property to NLP models. We transform publicly available pretrained embeddings from three popular toolkits (word2vec, GloVe, and FastText) and evaluate on a variety of intrinsic tasks, which model linguistic information in the vector space, and extrinsic tasks, which use vectors as input to machine learning models. We find that intrinsic evaluations are highly sensitive to absolute position, while extrinsic tasks rely primarily on local similarity. Our findings suggest that future embedding models and post-processing techniques should focus primarily on similarity to nearby points in vector space.

Highlights

Learned vector representations of words, known as word embeddings, have become ubiquitous throughout natural language processing (NLP) applications
These are: affine transformation, which obfuscates the original position of the origin; cosine distance encoding, which obfuscates the original distribution of feature values in Rd; nearest neighbor encoding, which obfuscates global pairwise distances; and random encoding
In order to measure the contributions of each geometric aspect described in Section 3 to the utility of word embeddings as input features, we evaluate embeddings transformed using our sequence of operations on a battery of standard intrinsic evaluations, which model linguistic information directly in the vector space; and extrinsic evaluations, which use the embeddings as input to learned models for downstream applications Our intrinsic evaluations include: (a) Results of intrinsic evaluations (b) Results of extrinsic evaluations

Summary

Introduction

Learned vector representations of words, known as word embeddings, have become ubiquitous throughout natural language processing (NLP) applications. It is intended to evaluate the semantic content of embedding spaces, as opposed to characteristics of the feature space itself. Geometric analysis offers another recent angle from which to understand the properties of word embeddings, both in terms of their distribution (Mimno and Thompson, 2017) and correlation with downstream performance (Chandrahas et al, 2018). Through such geometric investigations, neighborhood-based semantic characterizations are augmented with information about the continuous feature space of an embedding. Geometric features offer a more direct connection to the assumptions made by neural models about continuity in input spaces (Szegedy et al, 2014), as well as the use of recent contextualized representation methods using continuous language models (Peters et al, 2018; Devlin et al, 2018)

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Characterizing the Impact of Geometric Properties of Word Embeddings on Task Performance

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 42	License type: cc-by

Similar Papers

Between-Subject Variability in Transfer-of-Training of Skill-Based Manual Control Behavior
Daan M Pool ... Peter M.T Zaal
-
Daan M Pool, et. al.Daan M Pool ... Peter M.T Zaal
01 Oct 2015
01 Oct 2015

Changes in task performance and frontal cortex activation within and over sessions during the n-back task
Michael K Yeung ... Yvonne M Y Han
Scientific Reports | VOL. 13
Michael K Yeung, et. al.Michael K Yeung ... Yvonne M Y Han
27 Feb 2023
Scientific Reports | VOL. 13

Stability and change in longitudinal water-level task performance.
Hoben Thomas ... Thomas Kessler
Developmental psychology | VOL. 35
Hoben Thomas, et. al.Hoben Thomas ... Thomas Kessler
01 Jan 1998
Developmental psychology | VOL. 35

CPM: A large-scale generative Chinese Pre-trained language model
Zhengyan Zhang ...
AI Open | VOL. 2
Zhengyan Zhang, et. al.Zhengyan Zhang ...
01 Jan 2020
AI Open | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Characterizing the Impact of Geometric Properties of Word Embeddings on Task Performance

Abstract

Highlights

Summary

Talk to us

Similar Papers