Bandit algorithms: A comprehensive review and their dynamic selection from a portfolio for multicriteria top-k recommendation

Alexandre Letard,Nicolas Gutowski,Olivier Camp,Tassadit Amghar

doi:10.1016/j.eswa.2024.123151

Abstract

This paper discusses the use of portfolio approaches based on bandit algorithms to optimize multicriteria decision-making in recommender systems (accuracy and diversity). While previous research has primarily focused on single-item recommendations, this study extends the research to consider the recommendation of several items per iteration. Two methods, Multiple-play Gorthaur and Budgeted-Gorthaur, are proposed to solve the algorithm selection problem and their performances on real-world datasets are compared. Both methods provide a generalization of the Gorthaur method, which enables it to operate with any Multi-Armed Bandit (MAB) and Contextual Multi-Armed Bandit (CMAB) algorithm as meta-algorithm in a multi-item recommendation scenario. For Multiple-play Gorthaur, an empirical evaluation shows that the use of Thompson Sampling for algorithm selection (Gorthaur-TS) yields better results than the original EXP3 method (Gorthaur-EXP3) and the exclusive use of the optimal algorithm in the portfolio in contextual recommendation problems. Additionally, the paper includes a theoretical regret analysis based on the TS sketch proof applied for this variant of the method. Concerning Budgeted-Gorthaur, experiments show that it allows more flexibility to achieve a suitable trade-off between criteria and a broader coverage of the Pareto set of solutions, overcoming a natural limit of “a-priori” methods. Finally, this paper provides a detailed review, including pseudocodes and theoretical bounds, for all the fundamental MAB and CMAB algorithms used in this study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bandit algorithms: A comprehensive review and their dynamic selection from a portfolio for multicriteria top-k recommendation

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Jan 9, 2024
Citations: 1

Similar Papers

Using Individual Accuracy to Create Context for Non-Contextual Multi-Armed Bandit Problems
Nicolas Gutowski ... Fabien Chhel
-
Nicolas Gutowski, et. al.Nicolas Gutowski ... Fabien Chhel
01 Mar 2019
01 Mar 2019

Context Enhancement for Linear Contextual Multi-Armed Bandits
Nicolas Gutowski ... Fabien Chhel
-
Nicolas Gutowski, et. al.Nicolas Gutowski ... Fabien Chhel
01 Nov 2018
01 Nov 2018

Investigation of selection and application of Multi-Armed Bandit algorithms in recommendation system
Panyangjie Chen
Applied and Computational Engineering | VOL. 34
Panyangjie ChenPanyangjie Chen
04 Feb 2024
Applied and Computational Engineering | VOL. 34

Online learning for self-optimization in heterogeneous networks
José Antonio Ayala Romero
-
José Antonio Ayala RomeroJosé Antonio Ayala Romero
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bandit algorithms: A comprehensive review and their dynamic selection from a portfolio for multicriteria top-k recommendation

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications