Self-Supervised Reinforcement Learning for Recommender Systems

Xin Xin,Joemon M Jose,Ioannis Arapakis,Alexandros Karatzoglou

doi:10.1145/3397271.3401147

Abstract

In session-based or sequential recommendation, it is important to consider a number of factors like long-term user engagement, multiple types of user-item interactions such as clicks, purchases etc. The current state-of-the-art supervised approaches fail to model them appropriately. Casting sequential recommendation task as a reinforcement learning (RL) problem is a promising direction. A major component of RL approaches is to train the agent through interactions with the environment. However, it is often problematic to train a recommender in an on-line fashion due to the requirement to expose users to irrelevant recommendations. As a result, learning the policy from logged implicit feedback is of vital importance, which is challenging due to the pure off-policy setting and lack of negative rewards (feedback). In this paper, we propose self-supervised reinforcement learning for sequential recommendation tasks. Our approach augments standard recommendation models with two output layers: one for self-supervised learning and the other for RL. The RL part acts as a regularizer to drive the supervised layer focusing on specific rewards (e.g., recommending items which may lead to purchases rather than clicks) while the self-supervised layer with cross-entropy loss provides strong gradient signals for parameter updates. Based on such an approach, we propose two frameworks namely Self-Supervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC). We integrate the proposed frameworks with four state-of-the-art recommendation models. Experimental results on two real-world datasets demonstrate the effectiveness of our approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-Supervised Reinforcement Learning for Recommender Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Discovering Lin-Kernighan-Helsgaun heuristic for routing optimization using self-supervised reinforcement learning
Qi Wang ... Chunlei Tang
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Qi Wang, et. al.Qi Wang ... Chunlei Tang
26 Aug 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

Adaptive self-supervised learning for sequential recommendation
Xiujuan Sun ... Shaoqing Wang
Neural Networks | VOL. 179
Xiujuan Sun, et. al.Xiujuan Sun ... Shaoqing Wang
24 Jul 2024
Neural Networks | VOL. 179

Self-supervised reinforcement learning-based energy management for a hybrid electric vehicle
Chunyang Qi ... Shixin Song
Journal of Power Sources | VOL. 514
Chunyang Qi, et. al.Chunyang Qi ... Shixin Song
01 Dec 2021
Journal of Power Sources | VOL. 514

Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards
Yongle Luo ... Bo Song
Neurocomputing | VOL. 557
Yongle Luo, et. al.Yongle Luo ... Bo Song
01 Aug 2023
Neurocomputing | VOL. 557

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Supervised Reinforcement Learning for Recommender Systems

Abstract

Talk to us

Similar Papers