Leveraging semantic resources in diversified query expansion

Adit Krishnan,Sayan Ranu,Deepak P,Sameep Mehta

doi:10.1007/s11280-017-0468-7

Abstract

A search query, being a very concise grounding of user intent, could potentially have many possible interpretations. Search engines hedge their bets by diversifying top results to cover multiple such possibilities so that the user is likely to be satisfied, whatever be her intended interpretation. Diversified Query Expansion is the problem of diversifying query expansion suggestions, so that the user can specialize the query to better suit her intent, even before perusing search results. In this paper, we consider the usage of semantic resources and tools to arrive at improved methods for diversified query expansion. In particular, we develop two methods, those that leverage Wikipedia and pre-learnt distributional word embeddings respectively. Both the approaches operate on a common three-phase framework; that of first taking a set of informative terms from the search results of the initial query, then building a graph, following by using a diversity-conscious node ranking to prioritize candidate terms for diversified query expansion. Our methods differ in the second phase, with the first method Select-Link-Rank (SLR) linking terms with Wikipedia entities to accomplish graph construction; on the other hand, our second method, Select-Embed-Rank (SER), constructs the graph using similarities between distributional word embeddings. Through an empirical analysis and user study, we show that SLR ourperforms state-of-the-art diversified query expansion methods, thus establishing that Wikipedia is an effective resource to aid diversified query expansion. Our empirical analysis also illustrates that SER outperforms the baselines convincingly, asserting that it is the best available method for those cases where SLR is not applicable; these include narrow-focus search systems where a relevant knowledge base is unavailable. Our SLR method is also seen to outperform a state-of-the-art method in the task of diversified entity ranking.

Highlights

Users of a search system may choose the same initial search query for varying information needs
Another work [19] proposes scoring candidate query expansion terms using the similarity of their word embeddings to those of the terms in the query. Whole both these methods do not incorporate mechanisms for diversifications within them, we extend the latter model, called RM-CombSum with an Maximum Marginal Relevance (MMR) [5] based diversification, leading to a word-embedding based diversified query expansion method that we will use as a baseline method in our empirical evaluation
We considered the task of leveraging external semantic resources for the Diversified Query Expansion task

Summary

Introduction

Users of a search system may choose the same initial search query for varying information needs. Such difficulties in covering long tail aspects, as noted in [2], led to research interest in a slightly different task attacking the same larger goal, that of Diversified Query Expansion (DQE). For an unambiguous query: python programming, there are many aspects based on whether the user is interested in books, software or courses. For another seemingly unambiguous query, india, the aspects of interest could include railways, maps, news and cricket

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: World Wide Web	Publication Date: Jun 5, 2017
Citations: 23	License type: open-access

R Discovery Prime

R Discovery Prime

Leveraging semantic resources in diversified query expansion

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: World Wide Web

Lead the way for us

Similar Papers

A Query Expansion Method Based on Evolving Source Code
Huan Jin ... Lei Xiong
Wuhan University Journal of Natural Sciences | VOL. 24
Huan Jin, et. al.Huan Jin ... Lei Xiong
12 Sep 2019
Wuhan University Journal of Natural Sciences | VOL. 24

An Optimal Ranking Approach for Cluster based of Clicked URLsusing Firefly Algorithm for Efficient Personalized Web Search
...
International Journal of Advanced Research in Computer Science | VOL. 8
, et. al. ...
20 Jun 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Variation in Results of Three Biology‐Focused Search Engines: A Case Study Using North American Tree Species
Pete Bettinger ... Jacek Siry
The Bulletin of the Ecological Society of America | VOL. 102
Pete Bettinger, et. al.Pete Bettinger ... Jacek Siry
09 Nov 2020
The Bulletin of the Ecological Society of America | VOL. 102

Select, Link and Rank: Diversified Query Expansion and Entity Ranking Using Wikipedia
Adit Krishnan ... Sayan Ranu
-
Adit Krishnan, et. al.Adit Krishnan ... Sayan Ranu
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging semantic resources in diversified query expansion

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: World Wide Web