Thompson Sampling for Factored Multi-Agent Bandits

Timothy Verstraeten ,Eugenio Bargiacchi ,Diederik M Roijers ,Pieter Libin ,Ann Nowé

doi:10.48448/bbnc-n826

Abstract

Multi-agent coordination is prevalent in many real-world applications. However, such coordination is challenging due to its combinatorial nature. An important observation in this regard is that agents in the real world often only directly affect a limited set of neighbouring agents. Leveraging such loose couplings among agents is key to making coordination in multi-agent systems feasible. In this work, we focus on learning to coordinate. Specifically, we consider the multi-agent multi-armed bandit framework, in which fully cooperative loosely-coupled agents must learn to coordinate their decisions to optimize a common objective. We propose multi-agent Thompson sampling (MATS), a new Bayesian exploration-exploitation algorithm that leverages loose couplings. We empirically show that MATS outperforms the state-of-the-art algorithm, MAUCE, on two synthetic benchmarks with Bernoulli-distributed rewards. Our results show that MATS improves significantly upon state-of-the-art coordination methods in terms of performance, demonstrating the value of using MATS in applications with sparse neighbourhood structures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Thompson Sampling for Factored Multi-Agent Bandits

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Thompson Sampling for Factored Multi-Agent Bandits
...
-
, et. al. ...
06 May 2020
06 May 2020

Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures
Timothy Verstraeten ... Eugenio Bargiacchi
Scientific Reports | VOL. 10
Timothy Verstraeten, et. al.Timothy Verstraeten ... Eugenio Bargiacchi
21 Apr 2020
Scientific Reports | VOL. 10

A Fast Machine Learning for 5G Beam Selection for Unmanned Aerial Vehicle Applications
...
-
, et. al. ...
06 Jun 2020
06 Jun 2020

Bayesian Algorithms for Decentralized Stochastic Bandits
Anusha Lalitha ... Andrea Goldsmith
IEEE Journal on Selected Areas in Information Theory | VOL. 2
Anusha Lalitha, et. al.Anusha Lalitha ... Andrea Goldsmith
01 Jun 2021
IEEE Journal on Selected Areas in Information Theory | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Thompson Sampling for Factored Multi-Agent Bandits

Abstract

Talk to us

Similar Papers