Word Sense Disambiguation System Research Articles

There has been a great deal of recent research into word sense disambiguation, particularly since the inception of the Senseval evaluation exercises. Because a word often has more than one meaning, resolving word sense ambiguity could benefit applications that need some level of semantic interpretation of language input. A major problem is that the accuracy of word sense disambiguation systems is strongly dependent on the quantity of manually sense-tagged data available, and even the best systems, when tagging every word token in a document, perform little better than a simple heuristic that guesses the first, or predominant, sense of a word in all contexts. The success of this heuristic is due to the skewed nature of word sense distributions. Data for the heuristic can come from either dictionaries or a sample of sense-tagged data. However, there is a limited supply of the latter, and the sense distributions and predominant sense of a word can depend on the domain or source of a document. (The first sense of “star” for example would be different in the popular press and scientific journals). In this article, we expand on a previously proposed method for determining the predominant sense of a word automatically from raw text. We look at a number of different data sources and parameterizations of the method, using evaluation results and error analyses to identify where the method performs well and also where it does not. In particular, we find that the method does not work as well for verbs and adverbs as nouns and adjectives, but produces more accurate predominant sense information than the widely used SemCor corpus for nouns with low coverage in that corpus. We further show that the method is able to adapt successfully to domains when using domain specific corpora as input and where the input can either be hand-labeled for domain or automatically classified.

Read full abstract

Word Sense Disambiguation (WSD) is traditionally considered an Al-hard problem. A break-through in this field would have a significant impact on many relevant Web-based applications, such as Web information retrieval, improved access to Web services, information extraction, etc. Early approaches to WSD, based on knowledge representation techniques, have been replaced in the past few years by more robust machine learning and statistical techniques. The results of recent comparative evaluations of WSD systems, however, show that these methods have inherent limitations. On the other hand, the increasing availability of large-scale, rich lexical knowledge resources seems to provide new challenges to knowledge-based approaches. In this paper, we present a method, called structural semantic interconnections (SSI), which creates structural specifications of the possible senses for each word in a context and selects the best hypothesis according to a grammar G, describing relations between sense specifications. Sense specifications are created from several available lexical resources that we integrated in part manually, in part with the help of automatic procedures. The SSI algorithm has been applied to different semantic disambiguation problems, like automatic ontology population, disambiguation of sentences in generic texts, disambiguation of words in glossary definitions. Evaluation experiments have been performed on specific knowledge domains (e.g., tourism, computer networks, enterprise interoperability), as well as on standard disambiguation test sets.

Read full abstract

Word Sense Disambiguation System Research Articles

Articles published on Word Sense Disambiguation System

Unsupervised Acquisition of Predominant Word Senses

503 POSTER Comparative analysis of microarray testing and immunohistochemistry in patients with carcinoma of unknown primary – CUP syndrome

Word Sense Disambiguation by Combining Classifiers with an Adaptive Selection of Context Representation

Making fine-grained and coarse-grained sense distinctions, both manually and automatically

A Word Sense Disambiguation System Using Modified Naive Bayesian Algorithms for Indonesian Language

A detailed comparison of WSD systems: an analysis of the system answers for the SENSEVAL-2 English all words task

Structural semantic interconnections: a knowledge-based approach to word sense disambiguation

Búsqueda de Colocaciones en la Web para Sinónimos de Wordnet

A Word Sense Disambiguation System Using Modified Naive Bayesian Algorithms for Indonesian Language

Probabilistic word sense disambiguation

Disambiguating Nouns, Verbs, and Adjectives Using Automatically Acquired Selectional Preferences

SENSEVAL, a computer-based approximation to meaning

Parameter optimization for machine-learning of word sense disambiguation

Introduction to the special issue on evaluating word sense disambiguation systems

Semantic Encoding of Electronic Documents

Senseval/Romanseval: The Framework for Italian

Large scale WSD using learning applied to SENSEVAL

ROMANSEVAL: Results for Italian by SENSE

SENSE: an analogy-based Word Sense Disambiguation system

Selective sampling for example-based word sense disambiguation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Word Sense Disambiguation System Research Articles

Articles published on Word Sense Disambiguation System

Unsupervised Acquisition of Predominant Word Senses

503 POSTER Comparative analysis of microarray testing and immunohistochemistry in patients with carcinoma of unknown primary – CUP syndrome

Word Sense Disambiguation by Combining Classifiers with an Adaptive Selection of Context Representation

Making fine-grained and coarse-grained sense distinctions, both manually and automatically

A Word Sense Disambiguation System Using Modified Naive Bayesian Algorithms for Indonesian Language

A detailed comparison of WSD systems: an analysis of the system answers for the SENSEVAL-2 English all words task

Structural semantic interconnections: a knowledge-based approach to word sense disambiguation

Búsqueda de Colocaciones en la Web para Sinónimos de Wordnet

A Word Sense Disambiguation System Using Modified Naive Bayesian Algorithms for Indonesian Language

Probabilistic word sense disambiguation

Disambiguating Nouns, Verbs, and Adjectives Using Automatically Acquired Selectional Preferences

SENSEVAL, a computer-based approximation to meaning

Parameter optimization for machine-learning of word sense disambiguation

Introduction to the special issue on evaluating word sense disambiguation systems

Semantic Encoding of Electronic Documents

Senseval/Romanseval: The Framework for Italian

Large scale WSD using learning applied to SENSEVAL

ROMANSEVAL: Results for Italian by SENSE

SENSE: an analogy-based Word Sense Disambiguation system

Selective sampling for example-based word sense disambiguation