Word Sense Disambiguation Approach Research Articles

Word Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and semantic parsing. In this work, a new approach for WSD is proposed which uses a neural network as a surrogate fitness function in a metaheuristic algorithm. Also, a new method for simultaneous training of word and sense embeddings is proposed in this work. Accordingly, the node2vec algorithm is employed on the WordNet graph to generate sequences containing both words and senses. These sequences are then used along with paragraphs from Wikipedia in the word2vec algorithm to generate embeddings for words and senses at the same time. In order to address data imbalance in this task, sense probability distribution data extracted from the training corpus is used in the search process of the proposed simulated annealing algorithm. Furthermore, we introduce a new approach for clustering and mapping senses in the WordNet graph, which considerably improves the accuracy of the proposed method. In this approach, nodes in the WordNet graph are clustered on the condition that no two senses of the same word be present in one cluster. Then, repeatedly, all nodes in each cluster are mapped to a randomly selected node from that cluster, meaning that the representative node can take advantage of the training instances of all the other nodes in the cluster. Training the proposed method in this work is done using the SemCor dataset and the SemEval-2015 dataset has been used as the validation set. The final evaluation of the system is performed on SensEval-2, SensEval-3, SemEval-2007, SemEval-2013, SemEval-2015, and the concatenation of all five mentioned datasets. The performance of the system is also evaluated on the four content word categories, namely, nouns, verbs, adjectives, and adverbs. Experimental results show that the proposed method achieves accuracies in the range of 74.8 to 84.6 percent in the ten aforementioned evaluation categories which are close to and in some cases better than the state of the art in this task.

Read full abstract

Word sense disambiguation is a process to correctly identify the meanings of words in a given context. Being important in many natural language processing applications, this process is crucial in automatically understanding natural language expressions. Herein, we propose a variation of a well-known unsupervised graph-based word sense disambiguation method that utilizes all possible semantic information from a used lexical resource to increase graph-semantic connectivity for identifying the intended meanings of words in a given context. If the words have multiple potential meanings (senses) based on context, the proposed method builds an expanded graph representing most relevant semantic information of the words to be disambiguated. Nodes in the graph correspond to the context expansion set, which contains all associated information of each possible meaning of the word (word sense), and edges represent the semantic similarity between the expanded sets (nodes). Simultaneously, actual meaning is assigned to each target word using a locate graph centrality measure, which provides the degree of importance between graph nodes. Unlike most existing graph-based word sense disambiguation methods, wherein semantic relations (edges) between nodes are measured at the word level, the proposed method measures graph node semantic relations at the sentence level by expanding the words’ context, which contains all associated information for each possible word sense. Consequently, the proposed method can capture a higher degree of semantic information than existing approaches, thereby increasing semantic connectivity through a graph’s edges. Empirical results on benchmark datasets demonstrate that the proposed method outperforms all compared state-of-the-art graph-based word sense disambiguation approaches reported herein. We also report results obtained by applying the proposed method to a sentiment analysis task. These results demonstrate that the proposed method can determine the overall sentiment orientation of a given textual context.

Read full abstract

Word Sense Disambiguation Approach Research Articles

Articles published on Word Sense Disambiguation Approach

A Comprehensive Review of Word Sense Disambiguation Research in few Indian Languages: Implications for Educational Tools

EnhancedBERT: A feature-rich ensemble model for Arabic word sense disambiguation with statistical analysis and optimized data collection

Stacking of BERT and CNN Models for Arabic Word Sense Disambiguation

Sentence Semantic Similarity based Complex Network approach for Word Sense Disambiguation

Lexeme connexion measure of cohesive lexical ambiguity revealing factor: a robust approach for word sense disambiguation of Bengali text

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

A Thematic-Role-Based Approach for Word Sense Disambiguation

A Word Sense Disambiguatin Approach For Romanian Language

ULMFiT Embedding(s) for Context and Extended Gloss Intersection for Marathi Word Sense Disambiguation

Supervised, Unsupervised and Semi-Supervised Word Sense Disambiguation Approaches

A metaheuristic with a neural surrogate function for Word Sense Disambiguation

ADCSA-WSD: Adapted Discrete Crow Search Algorithm for Word Sense Disambiguation

A novel word sense disambiguation approach using WordNet knowledge graph

МОДИФИЦИРОВАННЫЙ МЕТОД УСТРАНЕНИЯ НЕОДНОЗНАЧНОСТИ СМЫСЛА СЛОВ, ОСНОВАННЫЙ НА МЕТОДАХ РАСПРЕДЕЛЕННОГО ПРЕДСТАВЛЕНИЯ

A REVIEW ON WORD SENSE DISAMBIGUATION EMPHASIZING THE DATA RESOURCES ON WORDNET AND CORPUS

An adaptive approach for word sense disambiguation for Hindi language

Hypernymy in WordNet, Its Role in WSD and Its Limitations

Context expansion approach for graph-based word sense disambiguation

Explicitly Modeling Word Translations in Neural Machine Translation

A Semantic Framework for Extracting Taxonomic Relations from Text Corpus

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Word Sense Disambiguation Approach Research Articles

Articles published on Word Sense Disambiguation Approach

A Comprehensive Review of Word Sense Disambiguation Research in few Indian Languages: Implications for Educational Tools

EnhancedBERT: A feature-rich ensemble model for Arabic word sense disambiguation with statistical analysis and optimized data collection

Stacking of BERT and CNN Models for Arabic Word Sense Disambiguation

Sentence Semantic Similarity based Complex Network approach for Word Sense Disambiguation

Lexeme connexion measure of cohesive lexical ambiguity revealing factor: a robust approach for word sense disambiguation of Bengali text

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

A Thematic-Role-Based Approach for Word Sense Disambiguation

A Word Sense Disambiguatin Approach For Romanian Language

ULMFiT Embedding(s) for Context and Extended Gloss Intersection for Marathi Word Sense Disambiguation

Supervised, Unsupervised and Semi-Supervised Word Sense Disambiguation Approaches

A metaheuristic with a neural surrogate function for Word Sense Disambiguation

ADCSA-WSD: Adapted Discrete Crow Search Algorithm for Word Sense Disambiguation

A novel word sense disambiguation approach using WordNet knowledge graph

МОДИФИЦИРОВАННЫЙ МЕТОД УСТРАНЕНИЯ НЕОДНОЗНАЧНОСТИ СМЫСЛА СЛОВ, ОСНОВАННЫЙ НА МЕТОДАХ РАСПРЕДЕЛЕННОГО ПРЕДСТАВЛЕНИЯ

A REVIEW ON WORD SENSE DISAMBIGUATION EMPHASIZING THE DATA RESOURCES ON WORDNET AND CORPUS

An adaptive approach for word sense disambiguation for Hindi language

Hypernymy in WordNet, Its Role in WSD and Its Limitations

Context expansion approach for graph-based word sense disambiguation

Explicitly Modeling Word Translations in Neural Machine Translation

A Semantic Framework for Extracting Taxonomic Relations from Text Corpus