Abstract

The major application areas of reinforcement learning (RL) have traditionally been game playing and continuous control. In recent years, however, RL has been increasingly applied in systems that interact with humans. RL can personalize digital systems to make them more relevant to individual users. Challenges in personalization settings may be different from challenges found in traditional application areas of RL. An overview of work that uses RL for personalization, however, is lacking. In this work, we introduce a framework of personalization settings and use it in a systematic literature review. Besides setting, we review solutions and evaluation strategies. Results show that RL has been increasingly applied to personalization problems and realistic evaluations have become more prevalent. RL has become sufficiently robust to apply in contexts that involve humans and the field as a whole is growing. However, it seems not to be maturing: the ratios of studies that include a comparison or a realistic evaluation are not showing upward trends and the vast majority of algorithms are used only once. This review can be used to find related work across domains, provides insights into the state of the field and identifies opportunities for future work.

Highlights

  • For several decades, both academia and commerce have sought to develop tailored products and services at low cost in various application domains

  • Challenges in personalization settings may be different from challenges found in traditional application areas of reinforcement learning (RL)

  • We introduce a framework of personalization settings and use it in a systematic literature review

Read more

Summary

Introduction

Both academia and commerce have sought to develop tailored products and services at low cost in various application domains. Netflix’s1 digital video delivery mechanism includes tracking of views and ratings These ease the gratification of diverse entertainment needs as they enable Netflix to offer instantaneous personalized content recommendations. As an example from the health domain, [234] achieve optimal per-patient treatment plans to address advanced metastatic stage IIIB/IV non-small cell lung cancer in simulation They state that ‘there is significant potential of the proposed methodology for developing personalized treatment strategies in other cancers, in cystic fibrosis, and in other life-threatening diseases’. This paper provides an overview and categorization of RL applications for personalization across a variety of application domains It aids researchers and practictioners in identifying related work relevant to a specific personalization setting, promotes the understanding of how RL is used for personalization and identifies challenges across domains.

Reinforcement learning for personalization
Algorithms
Multi-armed bandits
Incremental implementation
UCB: Upper-confidence bound
Function approximation
Value-based RL
Q-learning
Policy-gradient RL
Actor-critic
A classification of personalization settings
A systematic literature review
Inclusion criteria
Search strategy
Screening process
Data items
Synthesis and analysis
Results
19 Comparison with ‘no personalization’
Setting
Solution
Evaluation
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call