Subword-Based Compact Reconstruction for Open-Vocabulary Neural Word Embeddings

Shota Sasaki,Jun Suzuki,Kentaro Inui

doi:10.1109/taslp.2021.3125133

Shota Sasaki, Jun Suzuki + Show 1 more

Open Access

https://doi.org/10.1109/taslp.2021.3125133

Copy DOI

Abstract

The methodology of neural word embeddings has become an important fundamental resource for tackling many applications in the artificial intelligence (AI) research field. They have successfully been proven to capture high-quality syntactic and semantic relationships in a vector space. Despite their significant impact, neural word embeddings have several disadvantages. In this paper, we focus on two issues regarding well-trained word embeddings: (i) the massive memory requirement and (ii) the inapplicability of out-of-vocabulary (OOV) words. To overcome these two issues, we propose a method of reconstructing pre-trained word embeddings by using subword information that can effectively represent a large number of subword embeddings in a considerably small fixed space while preventing quality degradation from the original word embeddings. The key techniques of our method are twofold: memory-shared embeddings and a variant of the key-value-query self-attention mechanism. Our experiments show that our reconstructed subword-based word embeddings can successfully imitate well-trained word embeddings in a small fixed space while preventing quality degradation across several linguistic benchmark datasets and can simultaneously predict effective embeddings of OOV words. We also demonstrate the effectiveness of our reconstruction method when it is applied to downstream tasks, such as named entity recognition and natural language inference tasks.

Highlights

M ACHINE-readable representation of word meanings is one of the essential tools for tackling natural language understanding by computers.A recent trend is to embed word meanings into a vector space by using the rapidly developing neural word embedding methods, such as Skip-gram [1], GloVe [2], and fastText [3]
The subwordbased approach can greatly mitigate the OOV word issue. We extend this approach to simultaneously reduce the total number of embedding vectors through the reconstruction of word embeddings by using subwords
We experimentally show that our reconstructed subwordbased embeddings can successfully imitate well-trained word embeddings, such as fastText.600B and GloVe.840B, in a small fixed space while preventing quality degradation across several linguistic benchmark datasets from word similarity and analogy tasks

Summary

Introduction

A recent trend is to embed word meanings into a vector space by using the rapidly developing neural word embedding methods, such as Skip-gram [1], GloVe [2], and fastText [3]. The basic idea used to construct a vector space model is derived from the intuition that similar words tend to appear in similar contexts [4]. These methods have successfully been proven to capture high-quality syntactic and semantic relationships in a vector space. Studies in compositional semantics have revealed that the calculations underlying embedding vectors, such as addition and inner product, can be considered satisfactory approximations of the composed word meaning and the similarity between words, respectively [1]. Pre-trained word embeddings, especially those trained on a vast amount of text data, such as the Common

Objectives

Methods

Results

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Subword-Based Compact Reconstruction for Open-Vocabulary Neural Word Embeddings

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

Evaluating semantic relations in neural word embeddings with biomedical and general domain knowledge bases
Zhiwei Chen ... Zhe He
BMC Medical Informatics and Decision Making | VOL. 18
Zhiwei Chen, et. al.Zhiwei Chen ... Zhe He
01 Jul 2018
BMC Medical Informatics and Decision Making | VOL. 18

An exploration of semantic relations in neural word embeddings using extrinsic knowledge
Zhiwei Chen ... Xiuwen Liu
-
Zhiwei Chen, et. al.Zhiwei Chen ... Xiuwen Liu
01 Nov 2017
01 Nov 2017

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

Neural word and entity embeddings for ad hoc retrieval
Ebrahim Bagheri ... Feras Al-Obeidat
Information Processing and Management | VOL. 54
Ebrahim Bagheri, et. al.Ebrahim Bagheri ... Feras Al-Obeidat
25 Apr 2018
Information Processing and Management | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subword-Based Compact Reconstruction for Open-Vocabulary Neural Word Embeddings

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing