Combination of Auction Theory and Multi-Armed Bandits: Model, Algorithm, and Application

Guoju Gao,Sheng Zhang,Yu-E Sun,He Huang,Jie Wu,Mingjun Xiao,Sijie Huang

doi:10.1109/tmc.2022.3197459

Abstract

The multi-armed bandit (MAB) models have always received lots of attention from multiple research communities due to their broad application domains. The optimal selection problem with unknown rewards in advance, such as ad recommendation in social networks, spectrum access in the cognitive radio field, etc., can be efficiently solved by using MAB models. In an MAB model, given <inline-formula><tex-math notation="LaTeX">$N$</tex-math></inline-formula> arms whose rewards are unknown in advance, the player selects exactly one arm in each round, and his goal is to maximize the cumulative rewards over a fixed horizon. Further, a more general model called combinatorial MAB (i.e., CMAB), where <inline-formula><tex-math notation="LaTeX">$K$</tex-math></inline-formula> arms can be played simultaneously in each round, is put forward. However, the existing CMAB models neglect the strategic behaviors of the <inline-formula><tex-math notation="LaTeX">$N$</tex-math></inline-formula> arms, which indicates that one arm might report false information to increase its own profits. In fact, in many applications such as user selection in crowdsensing, the arms are not the feelingless machines but the rational individuals. To this end, we combine the upper confidence bound (UCB) with auction theory to develop a new algorithm called auction-based UCB (AUCB). We divide the auction-based CMAB problem into two sub-problems: winning arm selection and payment computation problems. For AUCB, we derive an upper bound on regret and prove the truthfulness in one round, individual rationality, and computational efficiency. In addition, we consider an extended situation that some arms may be unavailable in some rounds and the arms will bid inconsistently in different rounds. We devise another algorithm called eAUCB to solve this problem. Extensive simulations are conducted to show the significant performance of the proposed algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combination of Auction Theory and Multi-Armed Bandits: Model, Algorithm, and Application

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Mobile Computing

Lead the way for us

Journal: IEEE Transactions on Mobile Computing	Publication Date: Jan 1, 2022
Citations: 4

Similar Papers

Auction-Based Combinatorial Multi-Armed Bandit Mechanisms with Strategic Arms
Guoju Gao ... Sheng Zhang
-
Guoju Gao, et. al.Guoju Gao ... Sheng Zhang
10 May 2021
10 May 2021

Enhancing UCB-tuned and Asymptotically Optimal UCB Algorithms through Weighted Average Techniques in Multi-Armed Bandit Scenarios
Chang Qu
Highlights in Science, Engineering and Technology | VOL. 94
Chang QuChang Qu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

In-depth Exploration and Implementation of Multi-Armed Bandit Models Across Diverse Fields
Jiazhen Wu
Highlights in Science, Engineering and Technology | VOL. 94
Jiazhen WuJiazhen Wu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Multi-armed bandits in the presence of side observations in social networks
Swapna Buccapatnam ... Atilla Eryilmaz
-
Swapna Buccapatnam, et. al.Swapna Buccapatnam ... Atilla Eryilmaz
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combination of Auction Theory and Multi-Armed Bandits: Model, Algorithm, and Application

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Mobile Computing