Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

Frans Oliehoek,Matthijs Spaan

doi:10.1609/aaai.v26i1.8257

Abstract

Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful framework for optimal decision making under the assumption of instantaneous communication. We focus on a delayed communication setting (MPOMDP-DC), in which broadcasted information is delayed by at most one time step. This model allows agents to act on their most recent (private) observation. Such an assumption is a strict generalization over having agents wait until the global information is available and is more appropriate for applications in which response time is critical. In this setting, however, value function backups are significantly more costly, and naive application of incremental pruning, the core of many state-of-the-art optimal POMDP techniques, is intractable. In this paper, we overcome this problem by demonstrating that computation of the MPOMDP-DC backup can be structured as a tree and introducing two novel tree-based pruning techniques that exploit this structure in an effective way. We experimentally show that these methods have the potential to outperform naive incremental pruning by orders of magnitude, allowing for the solution of larger problems.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Sep 20, 2021
Citations: 12

Similar Papers

Unsupervised anomaly detection in time series exploiting local and global information
Emanuele La Malfa ... Gabriele La Malfa
-
Emanuele La Malfa, et. al.Emanuele La Malfa ... Gabriele La Malfa
01 Jan 2019
01 Jan 2019

Anticipating by pigeons depends on local statistical information in a serial response time task.
Alyson L Froehlich ... Walter T Herbranson
Journal of experimental psychology. General | VOL. 133
Alyson L Froehlich, et. al.Alyson L Froehlich ... Walter T Herbranson
01 Jan 2004
Journal of experimental psychology. General | VOL. 133

In This Issue
-
Operations Research | VOL. 60
--
01 Oct 2012
Operations Research | VOL. 60

Use of fuzzy logic to increase stability in control volume-based facility modeling
Joseph Sheeley
-
Joseph SheeleyJoseph Sheeley
08 Jan 2001
08 Jan 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence