A Fuzzy Word Similarity Measure for Selecting Top-$k$ Similar Words in Query Expansion

Qian Liu,Junyu Xuan,Heyan Huang,Guangquan Zhang,Yang Gao,Jie Lu

doi:10.1109/tfuzz.2020.2993702

Abstract

Top-k words selection is a technique used to detect and return the k most similar words to a given word from a candidate set. This is a crucial and widely used tool in various tasks. The key issue in top-k words selection is how to measure the similarity between words. One popular and effective solution is to use a word embedding-based similarity measure, which represents words as low-dimensional vectors and measures the similarities between words according to the similarity of the vectors, using a metric. However, most word embedding methods only consider the local proximity properties of two words in a corpus. To mitigate this issue. In this article, we propose to use association rules for measuring word similarity at a global level, and a fuzzy similarity measure for top-k words selection that jointly encodes the local and the global similarities. Experiments on a real-world query task with three benchmark datasets, i.e., TREC-disk 4&5, WT10G, and RCV1, demonstrate the efficiency of the proposed method compared to several state-of-the-art baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Fuzzy Word Similarity Measure for Selecting Top-$k$ Similar Words in Query Expansion

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Fuzzy Systems

Lead the way for us

Journal: IEEE Transactions on Fuzzy Systems	Publication Date: May 16, 2020
Citations: 12

Similar Papers

Semantic similarity measure for Thai language
Papis Wongchaisuwat
-
Papis WongchaisuwatPapis Wongchaisuwat
01 Nov 2018
01 Nov 2018

Sense and Similarity: A Study of Sense-level Similarity Measures
Nicolai Erbs ... Torsten Zesch
-
Nicolai Erbs, et. al.Nicolai Erbs ... Torsten Zesch
01 Jan 2014
01 Jan 2014

On the creation of a fuzzy dataset for the evaluation of fuzzy semantic similarity measures
David Chandran ... Keeley Crockett
-
David Chandran, et. al.David Chandran ... Keeley Crockett
01 Jul 2014
01 Jul 2014

Enhanced word embedding similarity measures using fuzzy rules for query expansion
Qian Liu ... Guangquan Zhang
-
Qian Liu, et. al.Qian Liu ... Guangquan Zhang
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Fuzzy Word Similarity Measure for Selecting Top-$k$ Similar Words in Query Expansion

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Fuzzy Systems