Counterfactual Online Learning to Rank

Shengyao Zhuang,Guido Zuccon

doi:10.1007/978-3-030-45439-5_28

Abstract

Exploiting users’ implicit feedback, such as clicks, to learn rankers is attractive as it does not require editorial labelling effort, and adapts to users’ changing preferences, among other benefits. However, directly learning a ranker from implicit data is challenging, as users’ implicit feedback usually contains bias (e.g., position bias, selection bias) and noise (e.g., clicking on irrelevant but attractive snippets, adversarial clicks). Two main methods have arisen for optimizing rankers based on implicit feedback: counterfactual learning to rank (CLTR), which learns a ranker from the historical click-through data collected from a deployed, logging ranker; and online learning to rank (OLTR), where a ranker is updated by recording user interaction with a result list produced by multiple rankers (usually via interleaving).In this paper, we propose a counterfactual online learning to rank algorithm (COLTR) that combines the key components of both CLTR and OLTR. It does so by replacing the online evaluation required by traditional OLTR methods with the counterfactual evaluation common in CLTR. Compared to traditional OLTR approaches based on interleaving, COLTR can evaluate a large number of candidate rankers in a more efficient manner. Our empirical results show that COLTR significantly outperforms traditional OLTR methods. Furthermore, COLTR can reach the same effectiveness of the current state-of-the-art, under noisy click settings, and has room for future extensions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Counterfactual Online Learning to Rank

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 58	License type: NO-CC CODE

Similar Papers

Effects of Position Bias on Click-Based Recommender Evaluation
Katja Hofmann ... Alejandro Bellogín
-
Katja Hofmann, et. al.Katja Hofmann ... Alejandro Bellogín
01 Jan 2014
01 Jan 2014

The Impact of Implicit and Explicit Feedback on Performance and Experience during VR-Supported Motor Rehabilitation
Negin Hamzeheinejad ... Anuschka Rodenberg
-
Negin Hamzeheinejad, et. al.Negin Hamzeheinejad ... Anuschka Rodenberg
01 Mar 2021
01 Mar 2021

Investigating the Use of Deep Learning and Implicit Feedback in K12 Educational Recommender Systems
Sohum M Bhatt ... Katrien Verbert
IEEE Transactions on Learning Technologies | VOL. 17
Sohum M Bhatt, et. al.Sohum M Bhatt ... Katrien Verbert
01 Jan 2024
IEEE Transactions on Learning Technologies | VOL. 17

Effective Latent Models for Binary Feedback in Recommender Systems
Maksims Volkovs ... Guang Wei Yu
-
Maksims Volkovs, et. al.Maksims Volkovs ... Guang Wei Yu
09 Aug 2015
09 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Counterfactual Online Learning to Rank

Abstract

Talk to us

Similar Papers