Policy-Aware Unbiased Learning to Rank for Top-k Rankings

Harrie Oosterhuis,Maarten De Rijke

doi:10.1145/3397271.3401102

Abstract

Counterfactual Learning to Rank (LTR) methods optimize ranking systems using logged user interactions that contain interaction biases. Existing methods are only unbiased if users are presented with all relevant items in every ranking. There is currently no existing counterfactual unbiased LTR method for top-k rankings. We introduce a novel policy-aware counterfactual estimator for LTR metrics that can account for the effect of a stochastic logging policy. We prove that the policy-aware estimator is unbiased if every relevant item has a non-zero probability to appear in the top-k ranking. Our experimental results show that the performance of our estimator is not affected by the size of k: for any k, the policy-aware estimator reaches the same retrieval performance while learning from top-k feedback as when learning from feedback on the full ranking. Lastly, we introduce novel extensions of traditional LTR methods to perform counterfactual LTR and to optimize top-k metrics. Together, our contributions introduce the first policy-aware unbiased LTR approach that learns from top-k feedback and optimizes top-k metrics. As a result, counterfactual LTR is now applicable to the very prevalent top-k ranking setting in search and recommendation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Policy-Aware Unbiased Learning to Rank for Top-k Rankings

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An evolutionary strategy with machine learning for learning to rank in information retrieval
Osman Ali Sadek Ibrahim ... D Landa-Silva
Soft Computing | VOL. 22
Osman Ali Sadek Ibrahim, et. al.Osman Ali Sadek Ibrahim ... D Landa-Silva
03 Jan 2018
Soft Computing | VOL. 22

On Application of Learning to Rank for E-Commerce Search
Shubhra Kanti Karmaker Santu ... Parikshit Sondhi
-
Shubhra Kanti Karmaker Santu, et. al.Shubhra Kanti Karmaker Santu ... Parikshit Sondhi
07 Aug 2017
07 Aug 2017

A SURVEY ON LEARNING TO RANK ALGORITHMS
L Lakshmi
International Journal of Advanced Research in Computer Science | VOL. 9
L LakshmiL Lakshmi
20 Feb 2018
International Journal of Advanced Research in Computer Science | VOL. 9

A graph-based feature selection method for learning to rank using spectral clustering for redundancy minimization and biased PageRank for relevance analysis
Jen-Yuan Yeh ... Cheng-Jung Tsai
Computer Science and Information Systems | VOL. 19
Jen-Yuan Yeh, et. al.Jen-Yuan Yeh ... Cheng-Jung Tsai
01 Jan 2021
Computer Science and Information Systems | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Policy-Aware Unbiased Learning to Rank for Top-k Rankings

Abstract

Talk to us

Similar Papers