A metaheuristic with a neural surrogate function for Word Sense Disambiguation

Azim Keshavarzian Nodehi,Nasrollah Moghadam Charkari

doi:10.1016/j.mlwa.2022.100369

Azim Keshavarzian Nodehi, Nasrollah Moghadam Charkari

Open Access

https://doi.org/10.1016/j.mlwa.2022.100369

Copy DOI

Journal: Machine Learning with Applications	Publication Date: Jun 17, 2022
Citations: 1	License type: cc-by-nc-nd

Affiliation: Tarbiat Modares University

Abstract

Word Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and semantic parsing. In this work, a new approach for WSD is proposed which uses a neural network as a surrogate fitness function in a metaheuristic algorithm. Also, a new method for simultaneous training of word and sense embeddings is proposed in this work. Accordingly, the node2vec algorithm is employed on the WordNet graph to generate sequences containing both words and senses. These sequences are then used along with paragraphs from Wikipedia in the word2vec algorithm to generate embeddings for words and senses at the same time. In order to address data imbalance in this task, sense probability distribution data extracted from the training corpus is used in the search process of the proposed simulated annealing algorithm. Furthermore, we introduce a new approach for clustering and mapping senses in the WordNet graph, which considerably improves the accuracy of the proposed method. In this approach, nodes in the WordNet graph are clustered on the condition that no two senses of the same word be present in one cluster. Then, repeatedly, all nodes in each cluster are mapped to a randomly selected node from that cluster, meaning that the representative node can take advantage of the training instances of all the other nodes in the cluster. Training the proposed method in this work is done using the SemCor dataset and the SemEval-2015 dataset has been used as the validation set. The final evaluation of the system is performed on SensEval-2, SensEval-3, SemEval-2007, SemEval-2013, SemEval-2015, and the concatenation of all five mentioned datasets. The performance of the system is also evaluated on the four content word categories, namely, nouns, verbs, adjectives, and adverbs. Experimental results show that the proposed method achieves accuracies in the range of 74.8 to 84.6 percent in the ten aforementioned evaluation categories which are close to and in some cases better than the state of the art in this task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A metaheuristic with a neural surrogate function for Word Sense Disambiguation

Abstract

Talk to us

Similar Papers

More From: Machine Learning with Applications

Lead the way for us

Similar Papers

A Genetic Algorithm Based Approach for Word Sense Disambiguation Using Fuzzy WordNet Graphs
Sonakshi Vij ... Amita Jain
-
Sonakshi Vij, et. al.Sonakshi Vij ... Amita Jain
01 Jan 2020
01 Jan 2020

An approach to reduce part of speech ambiguity using semantically annotated lexicon definitions
Andrei Minca ... Stefan Diaconescu
-
Andrei Minca, et. al.Andrei Minca ... Stefan Diaconescu
01 Sep 2012
01 Sep 2012

An Approach to Reduce Part of Speech Ambiguity Using Semantically Annotated Lexicon Definitions
Andrei Minc ... Tefan Diaconescu
-
Andrei Minc, et. al.Andrei Minc ... Tefan Diaconescu
01 Jan 2013
01 Jan 2013

Survey and Gap Analysis of Word Sense Disambiguation Approaches on Unstructured Texts
Krishnanjan Bhattacharjee ... Devika Verma
-
Krishnanjan Bhattacharjee, et. al.Krishnanjan Bhattacharjee ... Devika Verma
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A metaheuristic with a neural surrogate function for Word Sense Disambiguation

Abstract

Talk to us

Similar Papers

More From: Machine Learning with Applications