A Gloss Composition and Context Clustering Based Distributed Word Sense Representation Model

Tao Chen,Yulan He,Ruifeng Xu,Xuan Wang

doi:10.3390/e17096007

Abstract

In recent years, there has been an increasing interest in learning a distributed representation of word sense. Traditional context clustering based models usually require careful tuning of model parameters, and typically perform worse on infrequent word senses. This paper presents a novel approach which addresses these limitations by first initializing the word sense embeddings through learning sentence-level embeddings from WordNet glosses using a convolutional neural networks. The initialized word sense embeddings are used by a context clustering based model to generate the distributed representations of word senses. Our learned representations outperform the publicly available embeddings on half of the metrics in the word similarity task, 6 out of 13 sub tasks in the analogical reasoning task, and gives the best overall accuracy in the word sense effect classification task, which shows the effectiveness of our proposed distributed distribution learning model.

Highlights

The representation of knowledge has become focal areas in natural language processing
The main contributions of this work are as follows: (1) we propose to use a sentence composition model to capture word sense from a knowledge base, e.g., WordNet; (2) while previous approaches to sense vector clustering often adopted random initialization, we propose to initialize sense vectors and the number of sense clusters with the word sense knowledge learned from WordNet for better clustering results; (3) we further verify our learned distributed word sense representations on three different tasks, word similarity measurement, analogical reasoning and word sense effect classification
To deal with this problem, we propose to incorporate the sense vectors learned from WordNet glosses by convolutional neural network (CNN) composition as prior knowledge into a context clustering model such as the MSSG model proposed by Neelakantan et al [4]

Summary

Introduction

The representation of knowledge has become focal areas in natural language processing. A word sense is represented as a dense and real-valued vector To this end, most existing approaches adopted a cluster-based paradigm, which produces different sense vectors for each polysemy or homonymy through clustering the context of the target words. We modify the MSSG algorithm for context clustering by initializing the sense vectors with the representations learned by our CNN-based sentence composition model. The main contributions of this work are as follows: (1) we propose to use a sentence composition model to capture word sense from a knowledge base, e.g., WordNet; (2) while previous approaches to sense vector clustering often adopted random initialization, we propose to initialize sense vectors and the number of sense clusters with the word sense knowledge learned from WordNet for better clustering results; (3) we further verify our learned distributed word sense representations on three different tasks, word similarity measurement, analogical reasoning and word sense effect classification.

Distributed Representation for Word Sense

Distributed Sentence Composition Model

Our Approach

Word Embedding Construction

Training Objective

Neural Network Architecture

Context Clustering and VMSSG Model

13: Output: vsw

Experiments

Experimental Setup

Qualitative Evaluations

Word Similarity Task

Analogical Reasoning Task

Word Sense Effect Classification

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Aug 27, 2015
Citations: 33	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Gloss Composition and Context Clustering Based Distributed Word Sense Representation Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Improving Distributed Representation of Word Sense via WordNet Gloss Composition and Context Clustering
Tao Chen ... Xuan Wang
-
Tao Chen, et. al.Tao Chen ... Xuan Wang
01 Jan 2015
01 Jan 2015

Learning Word Sense Embeddings from Word Sense Definitions
Qi Li ... Tianshi Li
-
Qi Li, et. al.Qi Li ... Tianshi Li
01 Jan 2015
01 Jan 2015

The relationship between reasoning and language ability: comparing children with cochlear implants and children with typical hearing
Michaela Socher ... Björn Lyxell
Logopedics Phoniatrics Vocology | VOL. ahead-of-print
Michaela Socher, et. al.Michaela Socher ... Björn Lyxell
04 Nov 2020
Logopedics Phoniatrics Vocology | VOL. ahead-of-print

The role of inhibitory control in the development of analogical reasoning: From general to specific.
Xiao Yu ... Yinghe Chen
British Journal of Developmental Psychology | VOL. 38
Xiao Yu, et. al.Xiao Yu ... Yinghe Chen
13 Aug 2020
British Journal of Developmental Psychology | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Gloss Composition and Context Clustering Based Distributed Word Sense Representation Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy