Embedding Senses for Efficient Graph-based Word Sense Disambiguation

Luis Nieto Piña,Richard Johansson

doi:10.18653/v1/w16-1401

Abstract

We propose a simple graph-based method for word sense disambiguation (WSD) where sense and context embeddings are constructed by applying the Skip-gram method to random walks over the sense graph. We used this method to build a WSD system for Swedish using the SALDO lexicon, and evaluated it on six different annotated test sets. In all cases, our system was several orders of magnitude faster than a state-of-the-art PageRank-based system, while outperforming a random baseline soundly.

Highlights

Word sense disambiguation (WSD) is a difficult task for automatic systems (Navigli, 2009)
We built a WSD system for Swedish by applying the random walk-based training described above to the SALDO lexicon (Borin et al, 2013). We evaluated this system on six different annotated corpora, in which the ambiguous words have been manually disambiguated according to SALDO, and compared it to random and firstsense baselines and UKB (Agirre and Soroa, 2009), a state-of-the-art graph-based WSD system
A model is trained on synthetic datasets compiled from random walks on SALDO

Summary

Introduction

Word sense disambiguation (WSD) is a difficult task for automatic systems (Navigli, 2009). Several methods are available that use LKBs for WSD (Navigli and Lapata, 2007; Agirre and Soroa, 2009) These approaches usually apply a relatively complex analysis of the underlying graph based on the context of a target word to disambiguate it; e.g., Agirre and Soroa (2009) use the Personalized PageRank algorithm to perform walks on the graph. These methods are computationally very costly, which makes them practically useless for large corpora

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Embedding Senses for Efficient Graph-based Word Sense Disambiguation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2016
Citations: 25	License type: cc-by

Similar Papers

A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.
Y Wu ... H Xu
Applied Clinical Informatics | VOL. 6
Y Wu, et. al.Y Wu ... H Xu
01 Jan 2015
Applied Clinical Informatics | VOL. 6

Name Disambiguation Analysis Using the Word Sense Disambiguation Method in Hadith
Ageng Prasetio ... Mochammad Bijaksana
Edumatic: Jurnal Pendidikan Informatika | VOL. 4
Ageng Prasetio, et. al.Ageng Prasetio ... Mochammad Bijaksana
20 Dec 2020
Edumatic: Jurnal Pendidikan Informatika | VOL. 4

An approach to reduce part of speech ambiguity using semantically annotated lexicon definitions
Andrei Minca ... Stefan Diaconescu
-
Andrei Minca, et. al.Andrei Minca ... Stefan Diaconescu
01 Sep 2012
01 Sep 2012

An Approach to Reduce Part of Speech Ambiguity Using Semantically Annotated Lexicon Definitions
Andrei Minc ... Tefan Diaconescu
-
Andrei Minc, et. al.Andrei Minc ... Tefan Diaconescu
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Embedding Senses for Efficient Graph-based Word Sense Disambiguation

Abstract

Highlights

Summary

Talk to us

Similar Papers