Adapting bandit algorithms for settings with sequentially available arms

Marco Gabrielli,Manuela Antonelli,Francesco Trovò

doi:10.1016/j.engappai.2023.107815

Abstract

Many real-world applications involve a sequential decision-making process where the options presented simultaneously. However, other applications, such as, Internet campaign management and environmental monitoring, the available options are presented sequentially to the decision-maker who, at each time, is asked to select the proposed option or not. This scenario is defined as the Sequential Pull/No-Pull setting The present study aims at developing a meta-algorithm, namely Sequential Pull/No-pull for MAB (Seq), to adapt any classical MAB (Multi-Armed Bandit) policy for this setting both in the case of regret minimization (RM) and best-arm identification (BAI) problems. This is achieved by exploting the sequential nature of the these settings allowing to select multiple arms and gather more information compared to classical policies. The proposed Seq meta-algorithm provides the same theoretical guarantees as the MAB policy employed, but was shown to provide improved performance compared to several classical MAB policies in RM and BAI problems employing real-world data. In particular, in the RM scenario regarding Internet advertising optimization, Seq-adapted algorithm resulted, on average, in ≈10% lower regret during the whole time horizon than using classical MAB policies. When tested in a BAI problem involving the identification of the time of the day characterized by the highest concentration of pollutants in a water monitoring scenario, Seq identified the correct time in less than 4 days and 28 measurement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Jan 10, 2024
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Adapting bandit algorithms for settings with sequentially available arms

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Similar Papers

On Sequential Elimination Algorithms for Best-Arm Identification in Multi-Armed Bandits
Shahin Shahrampour ... Vahid Tarokh
IEEE Transactions on Signal Processing | VOL. 65
Shahin Shahrampour, et. al.Shahin Shahrampour ... Vahid Tarokh
08 Sep 2016
IEEE Transactions on Signal Processing | VOL. 65

Achieving complete learning in Multi-Armed Bandit problems
Sattar Vakili ... Qing Zhao
-
Sattar Vakili, et. al.Sattar Vakili ... Qing Zhao
01 Nov 2013
01 Nov 2013

Adversarial Bandits with Knapsacks
Nicole Immorlica ... Robert Schapire
-
Nicole Immorlica, et. al.Nicole Immorlica ... Robert Schapire
01 Nov 2019
01 Nov 2019

A day at the races
David E Losada ... David Elsweiler
Applied Intelligence | VOL. 52
David E Losada, et. al.David E Losada ... David Elsweiler
17 Aug 2021
Applied Intelligence | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adapting bandit algorithms for settings with sequentially available arms

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence