Listwise Generative Retrieval Models via a Sequential Learning Process

Yubao Tang,Wei Chen,Xueqi Cheng,Ruqing Zhang,Maarten De Rijke,Jiafeng Guo

doi:10.1145/3653712

Abstract

Recently, a novel generative retrieval (GR) paradigm has been proposed, where a single sequence-to-sequence model is learned to directly generate a list of relevant document identifiers (docids) given a query. Existing GR models commonly employ maximum likelihood estimation (MLE) for optimization: This involves maximizing the likelihood of a single relevant docid given an input query, with the assumption that the likelihood for each docid is independent of the other docids in the list. We refer to these models as the pointwise approach in this article. While the pointwise approach has been shown to be effective in the context of GR, it is considered sub-optimal due to its disregard for the fundamental principle that ranking involves making predictions about lists. In this article, we address this limitation by introducing an alternative listwise approach, which empowers the GR model to optimize the relevance at the docid list level. Specifically, we view the generation of a ranked docid list as a sequence learning process: At each step, we learn a subset of parameters that maximizes the corresponding generation likelihood of the i th docid given the (preceding) top i -1 docids. To formalize the sequence learning process, we design a positional conditional probability for GR. To alleviate the potential impact of beam search on the generation quality during inference, we perform relevance calibration on the generation likelihood of model-generated docids according to relevance grades. We conduct extensive experiments on representative binary and multi-graded relevance datasets. Our empirical results demonstrate that our method outperforms state-of-the-art GR baselines in terms of retrieval performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Listwise Generative Retrieval Models via a Sequential Learning Process

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems

Lead the way for us

Similar Papers

Instance ranking with multiple linear regression: Pointwise vs. listwise approaches
Joao Brito ... Joao Mendes-Moreira
-
Joao Brito, et. al.Joao Brito ... Joao Mendes-Moreira
01 Jun 2014
01 Jun 2014

Learning to rank with cross entropy
Yuan Lin ... Jiajin Wu
-
Yuan Lin, et. al.Yuan Lin ... Jiajin Wu
24 Oct 2011
24 Oct 2011

Progress in optimal processing of frequency-agile lidar data
Russell E Warren
-
Russell E WarrenRussell E Warren
31 Oct 1997
31 Oct 1997

On Calibration of Three-Axis Magnetometer
Y Wu ... W Shi
IEEE Sensors Journal | VOL. 15
Y Wu, et. al.Y Wu ... W Shi
07 Sep 2015
IEEE Sensors Journal | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Listwise Generative Retrieval Models via a Sequential Learning Process

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems