Incentivized Exploration for Multi-Armed Bandits under Reward Drift

Zhiyuan Liu,Kai Liu,Huazheng Wang,Fan Shen,Lijun Chen

doi:10.1609/aaai.v34i04.5937

Incentivized Exploration for Multi-Armed Bandits under Reward Drift

Zhiyuan Liu, Kai Liu + Show 3 more

Open Access

https://doi.org/10.1609/aaai.v34i04.5937

Copy DOI

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 5

Affiliation: University of Colorado System, University of Colorado Boulder, University of Virginia

#Greedy Choice #Thompson Sampling + Show 5 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We study incentivized exploration for the multi-armed bandit (MAB) problem where the players receive compensation for exploring arms other than the greedy choice and may provide biased feedback on reward. We seek to understand the impact of this drifted reward feedback by analyzing the performance of three instantiations of the incentivized MAB algorithm: UCB, ε-Greedy, and Thompson Sampling. Our results show that they all achieve O(log T) regret and compensation under the drifted reward, and are therefore effective in incentivizing exploration. Numerical examples are provided to complement the theoretical analysis.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.