Word-embedding-based pseudo-relevance feedback for Arabic information retrieval

Abdelkader El Mahdaouy,Eric Gaussier,Saïd Ouatik El Alaoui

doi:10.1177/0165551518792210

Abstract

Pseudo-relevance feedback (PRF) is a very effective query expansion approach, which reformulates queries by selecting expansion terms from top k pseudo-relevant documents. Although standard PRF models have been proven effective to deal with vocabulary mismatch between users’ queries and relevant documents, expansion terms are selected without considering their similarity to the original query terms. In this article, we propose a method to incorporate word embedding (WE) similarity into PRF models for Arabic information retrieval (IR). The main idea is to select expansion terms using their distribution in the set of top pseudo-relevant documents along with their similarity to the original query terms. Experiments are conducted on the standard Arabic TREC 2001/2002 collection using three neural WE models. The obtained results show that our PRF extensions significantly outperform their baseline PRF models. Moreover, they enhanced the baseline IR model by 22% and 68% for the mean average precision (MAP) and the robustness index (RI), respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Word-embedding-based pseudo-relevance feedback for Arabic information retrieval

Abstract

Talk to us

Similar Papers

More From: Journal of Information Science

Lead the way for us

Journal: Journal of Information Science	Publication Date: Aug 9, 2018
Citations: 21

Similar Papers

A Theoretical Analysis of Pseudo-Relevance Feedback Models
Stéphane Clinchant ... Eric Gaussier
-
Stéphane Clinchant, et. al.Stéphane Clinchant ... Eric Gaussier
29 Sep 2013
29 Sep 2013

Improving Pseudo Relevance Feedback in the Divergence from Randomness Model
Dipasree Pal ... Samar Bhattacharya
-
Dipasree Pal, et. al.Dipasree Pal ... Samar Bhattacharya
27 Sep 2015
27 Sep 2015

Theoretical Analysis of Interdependent Constraints in Pseudo-Relevance Feedback
Ali Montazeralghaem ... Hamed Zamani
-
Ali Montazeralghaem, et. al.Ali Montazeralghaem ... Hamed Zamani
27 Jun 2018
27 Jun 2018

A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art
Juan J Lastra-Díaz ... Eneko Agirre
Engineering Applications of Artificial Intelligence | VOL. 85
Juan J Lastra-Díaz, et. al.Juan J Lastra-Díaz ... Eneko Agirre
01 Aug 2019
Engineering Applications of Artificial Intelligence | VOL. 85

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Word-embedding-based pseudo-relevance feedback for Arabic information retrieval

Abstract

Talk to us

Similar Papers

More From: Journal of Information Science