Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

Yujia Sun,Jan Platoš

doi:10.1145/3594780

Abstract

Word sense disambiguation is a basic task in Natural Language Processing which aims to identify the most appropriate sense of ambiguous words in different contexts by applying algorithm models. In this work, we propose a model that uses a stacked bidirectional Long Short-Term Memory neural network and attention mechanism to determine the sense of ambiguous words. First, the stacked bidirectional Long Short-Term Memory is employed for deep embedding-based representation of sentences containing ambiguous words. Then, we utilize the self-attention mechanism to highlight the contextual features of ambiguous words, and then construct the overall semantic representation of sentences. Finally, the sentence semantic representation is applied to the multilayer perception classifier to generate the appropriate category of the ambiguous word sense items. This model is tested on the Semeval-2007 task-17: English lexical samples dataset and using examples of ambiguous words sourced from Oxford, Cambridge, and Collins dictionaries as extra test datasets. The effectiveness of the proposed approach is demonstrated via comparison with existing word sense disambiguation approaches. Our experimental results show that the proposed model outperforms other word sense disambiguation methods in terms of the evaluation metrics (Average Accuracy, Micro F1-Score, Kappa, and Matthews Correlation Coefficient), and exhibits strong interpretability.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: May 18, 2023
Citations: 1

Similar Papers

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks
Canlin Zhang ... Zhe He
BMC Bioinformatics | VOL. 20
Canlin Zhang, et. al.Canlin Zhang ... Zhe He
01 Dec 2019
BMC Bioinformatics | VOL. 20

An approach to reduce part of speech ambiguity using semantically annotated lexicon definitions
Stefan Diaconescu ... Andrei Minca
-
Stefan Diaconescu, et. al.Stefan Diaconescu ... Andrei Minca
01 Sep 2012
01 Sep 2012

An Approach to Reduce Part of Speech Ambiguity Using Semantically Annotated Lexicon Definitions
Andrei Minc ... Tefan Diaconescu
-
Andrei Minc, et. al.Andrei Minc ... Tefan Diaconescu
01 Jan 2013
01 Jan 2013

Word Sense Disambiguation Model based on Bi-LSTM
Liwei Zhang
-
Liwei ZhangLiwei Zhang
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing