Factored Online Planning in Many-Agent POMDPs

Maris F.L Galesloot,Nils Jansen,Sebastian Junges,Thiago D Simão

doi:10.1609/aaai.v38i16.29689

Abstract

In centralized multi-agent systems, often modeled as multi-agent partially observable Markov decision processes (MPOMDPs), the action and observation spaces grow exponentially with the number of agents, making the value and belief estimation of single-agent online planning ineffective. Prior work partially tackles value estimation by exploiting the inherent structure of multi-agent settings via so-called coordination graphs. Additionally, belief estimation methods have been improved by incorporating the likelihood of observations into the approximation. However, the challenges of value estimation and belief estimation have only been tackled individually, which prevents existing methods from scaling to settings with many agents. Therefore, we address these challenges simultaneously. First, we introduce weighted particle filtering to a sample-based online planner for MPOMDPs. Second, we present a scalable approximation of the belief. Third, we bring an approach that exploits the typical locality of agent interactions to novel online planning algorithms for MPOMDPs operating on a so-called sparse particle filter tree. Our experimental evaluation against several state-of-the-art baselines shows that our methods (1) are competitive in settings with only a few agents and (2) improve over the baselines in the presence of many agents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Factored Online Planning in Many-Agent POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

A Long-Term Earthquake Rate Model for the Central and Eastern United States from Smoothed Seismicity
Morgan P Moschetti
Bulletin of the Seismological Society of America | VOL. 105
Morgan P MoschettiMorgan P Moschetti
10 Nov 2015
Bulletin of the Seismological Society of America | VOL. 105

Multiagent Bayesian Deep Reinforcement Learning for Microgrid Energy Management Under Communication Failures
Hao Zhou ... Ivona Brandic
IEEE Internet of Things Journal | VOL. 9
Hao Zhou, et. al.Hao Zhou ... Ivona Brandic
15 Jul 2022
IEEE Internet of Things Journal | VOL. 9

Experimental Investigation on SI Engine Emissions via EGR and Catalytic Converter with Air Injection Mechanism
Mvs Murali Krishna ... S Narasimha Kumar
Journal of Mechanical Engineering | VOL. 16
Mvs Murali Krishna, et. al.Mvs Murali Krishna ... S Narasimha Kumar
01 Apr 2019
Journal of Mechanical Engineering | VOL. 16

Performance of line start permanent magnet synchronous motor with single-phase supply system
B.N Chaudhari ... B.G Fernandes
IEE Proceedings - Electric Power Applications | VOL. 151
B.N Chaudhari, et. al.B.N Chaudhari ... B.G Fernandes
01 Jan 2004
IEE Proceedings - Electric Power Applications | VOL. 151

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Factored Online Planning in Many-Agent POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence