A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

Samarth Gupta,Osman Yagan,Subhojyoti Mukherjee,Shreyas Chaudhari,Gauri Joshi

doi:10.1109/jsait.2020.3041246

Samarth Gupta, Osman Yagan + Show 3 more

Open Access

https://doi.org/10.1109/jsait.2020.3041246

Copy DOI

Abstract

We consider a finite-armed structured bandit problem in which mean rewards of different arms are known functions of a common hidden parameter 8*. Since we do not place any restrictions on these functions, the problem setting subsumes several previously studied frameworks that assume linear or invertible reward functions. We propose a novel approach to gradually estimate the hidden 8* and use the estimate together with the mean reward functions to substantially reduce exploration of sub-optimal arms. This approach enables us to fundamentally generalize any classical bandit algorithm including UCB and Thompson Sampling to the structured bandit setting. We prove via regret analysis that our proposed UCB-C algorithm (structured bandit versions of UCB) pulls only a subset of the suboptimal arms O(log T) times while the other sub-optimal arms (referred to as non-competitive arms) are pulled O(1) times. As a result, in cases where all sub-optimal arms are non-competitive, which can happen in many practical scenarios, the proposed algorithm achieves bounded regret. We also conduct simulations on the MOVIELENS recommendations dataset to demonstrate the improvement of the proposed algorithms over existing structured bandit algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Selected Areas in Information Theory	Publication Date: Nov 1, 2020
Citations: 50	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Information Theory

Lead the way for us

Similar Papers

A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits
Samarth Gupta ... Subhojyoti Mukherjee
-
Samarth Gupta, et. al.Samarth Gupta ... Subhojyoti Mukherjee
06 Jun 2021
06 Jun 2021

On multi-armed bandits theory and applications
Maryam Aziz
-
Maryam AzizMaryam Aziz
10 May 2021
10 May 2021

Comparative analysis and applications of classic multi-armed bandit algorithms and their variants
Bo Fei
Applied and Computational Engineering | VOL. 68
Bo FeiBo Fei
06 Jun 2024
Applied and Computational Engineering | VOL. 68

Regulation of exploration for simple regret minimization in Monte-Carlo tree search
Yun-Ching Liu ... Yoshimasa Tsuruoka
-
Yun-Ching Liu, et. al.Yun-Ching Liu ... Yoshimasa Tsuruoka
01 Aug 2015
01 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Information Theory