Reinforcement Learning based Recommender Systems: A Survey

M Mehdi Afsar,Behrouz Far,Trafford Crump

doi:10.1145/3543846

Abstract

Recommender systems (RSs) have become an inseparable part of our everyday lives. They help us find our favorite items to purchase, our friends on social networks, and our favorite movies to watch. Traditionally, the recommendation problem was considered to be a classification or prediction problem, but it is now widely agreed that formulating it as a sequential decision problem can better reflect the user-system interaction. Therefore, it can be formulated as a Markov decision process (MDP) and be solved by reinforcement learning (RL) algorithms. Unlike traditional recommendation methods, including collaborative filtering and content-based filtering, RL is able to handle the sequential, dynamic user-system interaction and to take into account the long-term user engagement. Although the idea of using RL for recommendation is not new and has been around for about two decades, it was not very practical, mainly because of scalability problems of traditional RL algorithms. However, a new trend has emerged in the field since the introduction of deep reinforcement learning (DRL) , which made it possible to apply RL to the recommendation problem with large state and action spaces. In this paper, a survey on reinforcement learning based recommender systems (RLRSs) is presented. Our aim is to present an outlook on the field and to provide the reader with a fairly complete knowledge of key concepts of the field. We first recognize and illustrate that RLRSs can be generally classified into RL- and DRL-based methods. Then, we propose an RLRS framework with four components, i.e., state representation, policy optimization, reward formulation, and environment building, and survey RLRS algorithms accordingly. We highlight emerging topics and depict important trends using various graphs and tables. Finally, we discuss important aspects and challenges that can be addressed in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning based Recommender Systems: A Survey

Abstract

Talk to us

Similar Papers

More From: ACM Computing Surveys

Lead the way for us

Journal: ACM Computing Surveys	Publication Date: Dec 15, 2022
Citations: 147

Similar Papers

Hierarchical reinforcement learning for transportation infrastructure maintenance planning
Zachary Hamida ... James-A Goulet
Reliability Engineering & System Safety | VOL. 235
Zachary Hamida, et. al.Zachary Hamida ... James-A Goulet
08 Mar 2023
Reliability Engineering & System Safety | VOL. 235

User Response Models to Improve a REINFORCE Recommender System
Minmin Chen ... Can Xu
-
Minmin Chen, et. al.Minmin Chen ... Can Xu
08 Mar 2021
08 Mar 2021

Improving the Performance of Batch-Constrained Reinforcement Learning in Continuous Action Domains via Generative Adversarial Networks
Baturay Saglam ... Suleyman S Kozat
-
Baturay Saglam, et. al.Baturay Saglam ... Suleyman S Kozat
15 May 2022
15 May 2022

Differentially Private Reinforcement Learning with Linear Function Approximation
Xingyu Zhou
ACM SIGMETRICS Performance Evaluation Review | VOL. 50
Xingyu ZhouXingyu Zhou
20 Jun 2022
ACM SIGMETRICS Performance Evaluation Review | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning based Recommender Systems: A Survey

Abstract

Talk to us

Similar Papers

More From: ACM Computing Surveys