Unbiased Learning to Rank: Counterfactual and Online Approaches

Harrie Oosterhuis,Maarten De Rijke,Rolf Jagerman

doi:10.1145/3366424.3383107

Abstract

This tutorial is about Unbiased Learning to Rank, a recent research field that aims to learn unbiased user preferences from biased user interactions. We will provide an overview of the two main families of methods in Unbiased Learning to Rank: Counterfactual Learning to Rank (CLTR) and Online Learning to Rank (OLTR) and their underlying theory. First, the tutorial will start with a brief introduction to the general Learning to Rank (LTR) field and the difficulties user interactions pose for traditional supervised LTR methods. The second part will cover Counterfactual Learning to Rank (CLTR), a LTR field that sprung out of click models. Using an explicit model of user biases, CLTR methods correct for them in their learning process and can learn from historical data. Besides these methods, we will also cover practical considerations, such as how certain biases can be estimated. In the third part of the tutorial we focus on Online Learning to Rank (OLTR), methods that learn by directly interacting with users and dealing with biases by adding stochasticity to displayed results. We will cover cascading bandits, dueling bandit techniques and the most recent pairwise differentiable approach. Finally, in the concluding part of the tutorial, both approaches are contrasted, highlighting their relative strengths and weaknesses, and presenting future directions of research. For LTR practitioners our comparison gives guidance on how the choice between methods should be made. For the field of Information Retrieval (IR) we aim to provide an essential guide on unbiased LTR to understanding and choosing between methodologies.

Highlights

Learning to Rank (LTR) has long been a core task in Information Retrieval (IR), as ranking models form the basis of most search and recommendation systems
The first approach to unbiased LTR that we discuss in the tutorial is Counterfactual Learning to Rank (CLTR); it has its roots in user modeling [5]
We provide an overview of the two main families of approaches to unbiased LTR and their underlying theory

Summary

Introduction

Learning to Rank (LTR) has long been a core task in Information Retrieval (IR), as ranking models form the basis of most search and recommendation systems. Ignoring these biases during the learning process will result in biased ranking models that are not fully optimized for user preferences [11]. The field of LTR from user interactions is mainly focussed on methods that remove biases from the learning process, resulting in unbiased LTR.

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unbiased Learning to Rank: Counterfactual and Online Approaches

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Apr 20, 2020
Citations: 17	License type: other-oa

Similar Papers

Using learning to rank approach for parallel corpora based cross language information retrieval
...
-
, et. al. ...
13 Aug 2012
13 Aug 2012

Query-dependent learning to rank for cross-lingual information retrieval
Elham Ghanbari ... Azadeh Shakery
Knowledge and Information Systems | VOL. 59
Elham Ghanbari, et. al.Elham Ghanbari ... Azadeh Shakery
04 Jul 2018
Knowledge and Information Systems | VOL. 59

Learning To Rank for E commerce Cart Optimization
Murali Mohana Krishna Dandu ... Om Goel
Universal Research Reports | VOL. 10
Murali Mohana Krishna Dandu, et. al. Murali Mohana Krishna Dandu ... Om Goel
30 Jun 2023
Universal Research Reports | VOL. 10

To Model or to Intervene
Rolf Jagerman ... Harrie Oosterhuis
-
Rolf Jagerman, et. al.Rolf Jagerman ... Harrie Oosterhuis
18 Jul 2019
18 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unbiased Learning to Rank: Counterfactual and Online Approaches

Abstract

Highlights

Summary

Talk to us

Similar Papers