Multi-armed bandits in the presence of side observations in social networks

Swapna Buccapatnam,Ness B Shroff,Atilla Eryilmaz

doi:10.1109/cdc.2013.6761049

Abstract

We consider the decision problem of an external agent choosing to execute one of M actions for each user in a social network. We assume that observing a user's actions provides valuable information for a larger set of users since each user's preferences are interrelated with those of her social peers. This falls into the well-known setting of the multi-armed bandit (MAB) problems, but with the critical new component of side observations resulting from interactions between users. Our contributions in this work are as follows: 1) We model the MAB problem in the presence of side observations and obtain an asymptotic lower bound (as a function of the network structure) on the regret (loss) of any uniformly good policy that achieves the maximum long term average reward. 2) We propose a randomized policy that explores actions for each user at a rate that is a function of her network position. We show that this policy achieves the asymptotic lower bound on regret associated with actions that are unpopular for all the users. 3) We derive an upper bound on the regret of existing Upper Confidence Bound (UCB) policies for MAB problems modified for our setting of side observations. We present case studies to show that these UCB policies are agnostic of the network structure and this causes their regret to suffer in a network setting. Our investigations in this work reveal the significant gains that can be obtained even through static network-aware policies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-armed bandits in the presence of side observations in social networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Stochastic bandits with side observations on networks
Swapna Buccapatnam ... Atilla Eryilmaz
ACM SIGMETRICS Performance Evaluation Review | VOL. 42
Swapna Buccapatnam, et. al.Swapna Buccapatnam ... Atilla Eryilmaz
16 Jun 2014
ACM SIGMETRICS Performance Evaluation Review | VOL. 42

Stochastic bandits with side observations on networks
Swapna Buccapatnam ... Atilla Eryilmaz
-
Swapna Buccapatnam, et. al.Swapna Buccapatnam ... Atilla Eryilmaz
16 Jun 2014
16 Jun 2014

Enhancing UCB-tuned and Asymptotically Optimal UCB Algorithms through Weighted Average Techniques in Multi-Armed Bandit Scenarios
Chang Qu
Highlights in Science, Engineering and Technology | VOL. 94
Chang QuChang Qu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

In-depth Exploration and Implementation of Multi-Armed Bandit Models Across Diverse Fields
Jiazhen Wu
Highlights in Science, Engineering and Technology | VOL. 94
Jiazhen WuJiazhen Wu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-armed bandits in the presence of side observations in social networks

Abstract

Talk to us

Similar Papers