Using learning to rank approach for parallel corpora based cross language information retrieval

Hosein Azarbonyad ,Azadeh Shakery ,Heshaam Faili

doi:10.3233/978-1-61499-098-7-79

Abstract

Learning to Rank (LTR) refers to machine learning techniques for training a model in a ranking task. LTR has been shown to be useful in many applications in information retrieval (IR). Cross language information retrieval (CLIR) is one of the major IR tasks that can potentially benefit from LTR to improve the ranking accuracy. CLIR deals with the problem of expressing query in one language and retrieving the related documents in another language. One of the most important issues in CLIR is how to apply monolingual IR methods in cross lingual environments. In this paper, we propose a new method to exploit LTR for CLIR in which documents are represented as feature vectors. This method provides a mapping based on IR heuristics to employ monolingual IR features in parallel corpus based CLIR. These mapped features are considered as training data for LTR. We show that using LTR trained on mapped features can improve CLIR performance. A comprehensive evaluation on the English-Persian CLIR suggests that our method has significant improvements over parallel corpora based methods and dictionary based methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using learning to rank approach for parallel corpora based cross language information retrieval

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Query-dependent learning to rank for cross-lingual information retrieval
Elham Ghanbari ... Azadeh Shakery
Knowledge and Information Systems | VOL. 59
Elham Ghanbari, et. al.Elham Ghanbari ... Azadeh Shakery
04 Jul 2018
Knowledge and Information Systems | VOL. 59

A comprehensive survey on cross-language information retrieval system
Gouranga Charan Jena ... Siddharth Swarup Rautaray
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 14
Gouranga Charan Jena, et. al.Gouranga Charan Jena ... Siddharth Swarup Rautaray
01 Apr 2019
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 14

Cross-Language Information Retrieval
Jian-Yun Nie
-
Jian-Yun NieJian-Yun Nie
01 Jan 2009
01 Jan 2009

Applications of tf-idf concept to improve monolingual and cross-language information retrieval based on word embeddings
Syandra Sari ... Mirna Adriani
-
Syandra Sari, et. al.Syandra Sari ... Mirna Adriani
15 Nov 2019
15 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using learning to rank approach for parallel corpora based cross language information retrieval

Abstract

Talk to us

Similar Papers