Bayes-Adaptive Interactive POMDPs

Brenda Ng,Kofi Boakye,Carol Meyers,Andrew Wang

doi:10.1609/aaai.v26i1.8264

Abstract

We introduce the Bayes-Adaptive Interactive Partially Observable Markov Decision Process (BA-IPOMDP), the first multiagent decision model that explicitly incorporates model learning. As in I-POMDPs, the BA-IPOMDP agent maintains beliefs over interactive states, which include the physical states as well as the other agents’ models. The BA-IPOMDP assumes that the state transition and observation probabilities are unknown, and augments the interactive states to include these parameters. Beliefs are maintained over this augmented interactive state space. This (necessary) state expansion exacerbates the curse of dimensionality, especially since each I-POMDP belief update is already a recursive procedure (because an agent invokes belief updates from other agents’ perspectives as part of its own belief update, in order to anticipate other agents’ actions). We extend the interactive particle filter to perform approximate belief update on BA-IPOMDPs. We present our findings on the multiagent Tiger problem.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayes-Adaptive Interactive POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Sep 20, 2021
Citations: 13

Similar Papers

State duration modelling in hidden Markov models
S.V Vaseghi
Signal processing | VOL. 41
S.V VaseghiS.V Vaseghi
01 Jan 1995
Signal processing | VOL. 41

Precise linearization of nonlinear, non-autonomous systems based on physical system modeling theory
H Harry Asada ... Sampriti Bhattacharyya
-
H Harry Asada, et. al.H Harry Asada ... Sampriti Bhattacharyya
01 May 2017
01 May 2017

A non-linear estimation and model predictive control algorithm based on ant colony optimization
Hadi Nobahari ... Saeed Nasrollahi
Transactions of the Institute of Measurement and Control | VOL. 41
Hadi Nobahari, et. al.Hadi Nobahari ... Saeed Nasrollahi
01 Feb 2019
Transactions of the Institute of Measurement and Control | VOL. 41

Federated Kalman Filtering Via Formation of Relation Equations in Augmented State Space
V A Tupysev
Journal of Guidance Control and Dynamics | VOL. 23
V A TupysevV A Tupysev
01 May 2000
Journal of Guidance Control and Dynamics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayes-Adaptive Interactive POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence