Abstract

Effective query expansion terms selection methods are really very important for improving the accuracy and efficiency of Pseudo-Relevance Feedback (PRF) based automatic query expansion techniques in information retrieval system. These methods remove irrelevant and redundant terms from the top retrieved feedback documents with respect to a user query. Individual terms selection methods have been widely investigated for improving its performance. However, it is always a challenging task to find an individual expansion terms selection method that would outperform other individual methods in most cases. In this paper, first we explore the possibility of improving the overall performance using individual terms selection methods. Second, we propose a model for combining multiple expansion terms selection methods by using a variety of ranks combining approaches. Third, semantic filtering used to filter out semantically irrelevant terms obtained after combining multiple terms selection methods. Fourth, the Genetic Algorithm used to make an optimal combination of query terms and candidate expansion terms obtained by applying ranks combination and semantic filtering approach. Our experimental results demonstrated that our proposed approaches achieved a significant improvement over each individual terms selection methods and related state-of-the-arts approaches.

Highlights

  • IntroductionOne of the most feasible and successful technique to handle this problem is Automatic Query Expansion (AQE) that automatically expand the original user query with some additional terms/words that are related to user query in some way

  • The term mismatch is one of the major problems in Information Retrieval (IR) system

  • We found that the performance of our proposed Query Expansion (QE) term selection approaches Chi-square Based Query Expansion (CHIBQE), Co-occurrence Based Query Expansion (CBQE), Binary Independent Model Based Query Expansion (BIMBQE) and Robertson Selection Value Based Query Expansion (RSVBQE) achieved a significant improvement over basic retrieval model Okapi-BM25

Read more

Summary

Introduction

One of the most feasible and successful technique to handle this problem is Automatic Query Expansion (AQE) that automatically expand the original user query with some additional terms/words that are related to user query in some way. In order to consider the above problem, there is a need for AQE techniques that can automatically reformulate the original user query. While there has been a slight increase in the number of long queries (of five or more words), the most prevalent queries are still those of one, two, and three words. In this situation, the need and the scope of AQE have increased, but it has some problems

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call