AMRank: An adversarial Markov ranking model combining short- and long-term returns

Dunlu Peng,Yichao Chen

doi:10.1016/j.eswa.2022.118512

Abstract

Learning to rank (LTR) is a method of ranking search results using machine learning techniques. Currently, the reinforcement-learning-based ranking models have achieved some success in LTR task. However, these models have disadvantages like high variance gradient estimates and train inefficiency, which bring great challenges to the convergence and accuracy of the ranking model. Combining short- and long-term returns, this paper proposes AMRank, an adversarial Markov ranking model, which is based on reinforcement learning and formalizes the ranking task as a Markov decision process. To address the aforementioned weaknesses, in AMRank, we present a sequence discriminator to output a long-term return with a smaller variance and conduct single step updates, and use a document discriminator to yield a short-term return. The two discriminators are trained simultaneously before the decision is made. In the training process, the policy network is applied as a generator to sample candidate documents and get negative samples. At the beginning of the decision, the discriminator outputs the returns based on the environment state and the policy, and finally updates the parameters of the policy network using the policy gradient method. Experimental results on three LETOR benchmark datasets, OHUSMED, MQ2007 and MQ2008, demonstrate that the proposed AMRank outperforms the baseline models in document ranking task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AMRank: An adversarial Markov ranking model combining short- and long-term returns

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Aug 22, 2022
Citations: 1

Similar Papers

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Efficient Robot Skills Learning with Weighted Near-Optimal Experiences Policy Optimization
Liwei Hou ... Qun Wang
Applied Sciences | VOL. 11
Liwei Hou, et. al.Liwei Hou ... Qun Wang
26 Jan 2021
Applied Sciences | VOL. 11

SFP-Rank: significant frequent pattern analysis for effective ranking
Yuanfeng Song ... Qiong Fang
Knowledge and Information Systems | VOL. 43
Yuanfeng Song, et. al.Yuanfeng Song ... Qiong Fang
25 Mar 2014
Knowledge and Information Systems | VOL. 43

A Knowledge-Fusion Ranking System with an Attention Network for Making Assignment Recommendations.
Canghong Jin ... Shengyu Ying
Computational intelligence and neuroscience | VOL. 2020
Canghong Jin, et. al.Canghong Jin ... Shengyu Ying
23 Dec 2020
Computational intelligence and neuroscience | VOL. 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AMRank: An adversarial Markov ranking model combining short- and long-term returns

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications