Leveraging Long Short-Term User Preference in Conversational Recommendation via Multi-agent Reinforcement Learning

Yang Deng,Wai Lam,Yaliang Li,Bolin Ding

doi:10.1109/tkde.2022.3225109

Abstract

Conversational recommender systems (CRS) endow traditional recommender systems with the capability of dynamically obtaining users' short-term preferences for items and attributes through interactive dialogues. There are three core challenges for CRS, including the intelligent decisions for what attributes to ask, which items to recommend, and when to ask or recommend, at each conversation turn. Previous methods mainly leverage reinforcement learning (RL) to learn conversational recommendation policies for solving one or two of these three decision-making problems in CRS with separated conversation and recommendation components. These approaches restrict the scalability and generality of CRS and fall short of preserving a stable training procedure. In the light of these challenges, we tackle these three decision-making problems in CRS as a unified policy learning task. In order to leverage different features that are important to each sub-problem and facilitate better unified policy learning in CRS, we propose two novel multi-agent RL-based frameworks, namely Independent and Hierarchical Multi-Agent UNIfied COnversational RecommeNders (IMA-UNICORN and HMA-UNICORN), respectively. In specific, two low-level agents enrich the state representations for attribute prediction and item recommendation, by combining the long-term user preference information from the historical interaction data and the short-term user preference information from the conversation history. A high-level meta agent is responsible for coordinating the low-level agents to adaptively make the final decision. Experimental results on four benchmark CRS datasets and a real-world E-Commerce application show that the proposed frameworks significantly outperform state-of-the-art methods. Extensive analyses further demonstrate the superior scalability of the MARL frameworks on the multi-round conversational recommendation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging Long Short-Term User Preference in Conversational Recommendation via Multi-agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Nov 1, 2023
Citations: 3

Similar Papers

Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
Yang Deng ... Yaliang Li
-
Yang Deng, et. al.Yang Deng ... Yaliang Li
11 Jul 2021
11 Jul 2021

Learning to Infer User Implicit Preference in Conversational Recommendation
Chenhao Hu ... Shuhua Huang
-
Chenhao Hu, et. al.Chenhao Hu ... Shuhua Huang
06 Jul 2022
06 Jul 2022

Critique Generation to Increase Diversity in Conversational Recipe Recommender System
David Wilson ... Nadia Najjar
The International FLAIRS Conference Proceedings | VOL. 34
David Wilson, et. al.David Wilson ... Nadia Najjar
18 Apr 2021
The International FLAIRS Conference Proceedings | VOL. 34

CRSAL
Xuhui Ren ... Hongzhi Yin
ACM Transactions on Information Systems | VOL. 38
Xuhui Ren, et. al.Xuhui Ren ... Hongzhi Yin
13 Jun 2020
ACM Transactions on Information Systems | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging Long Short-Term User Preference in Conversational Recommendation via Multi-agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering