Bayesian Learning of Other Agents' Finite Controllers for Interactive POMDPs

Alessandro Panella,Piotr Gmytrasiewicz

doi:10.1609/aaai.v30i1.10136

Abstract

We consider an autonomous agent operating in a stochastic, partially-observable, multiagent environment, that explicitly models the other agents as probabilistic deterministic finite-state controllers (PDFCs) in order to predict their actions. We assume that such models are not given to the agent, but instead must be learned from (possibly imperfect) observations of the other agents' behavior. The agent maintains a belief over the other agents' models, that is updated via Bayesian inference. To represent this belief we place a flexible stick-breaking distribution over PDFCs, that allows the posterior to concentrate around controllers whose size is not bounded and scales with the complexity of the observed data. Since this Bayesian inference task is not analytically tractable, we devise a Markov chain Monte Carlo algorithm to approximate the posterior distribution. The agent then embeds the result of this inference into its own decision making process using the interactive POMDP framework. We show that our learning algorithm can learn agent models that are behaviorally accurate for problems of varying complexity, and that the agent's performance increases as a result.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian Learning of Other Agents' Finite Controllers for Interactive POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 3, 2016
Citations: 4

Similar Papers

Interactive POMDPs with finite-state models of other agents
Alessandro Panella ... Piotr Gmytrasiewicz
Autonomous Agents and Multi-Agent Systems | VOL. 31
Alessandro Panella, et. al.Alessandro Panella ... Piotr Gmytrasiewicz
25 Jan 2017
Autonomous Agents and Multi-Agent Systems | VOL. 31

Bayesian data analysis for agricultural experiments
X Che ... S Xu
Canadian Journal of Plant Science | VOL. 90
X Che, et. al.X Che ... S Xu
01 Sep 2010
Canadian Journal of Plant Science | VOL. 90

Sequential Monte-Carlo algorithms for Bayesian model calibration – A review and method comparison✰
Matthias Speich ... Florian Hartig
Ecological Modelling | VOL. 455
Matthias Speich, et. al.Matthias Speich ... Florian Hartig
05 Jun 2021
Ecological Modelling | VOL. 455

Markov chain Monte Carlo methods: Theory and practice
David A Spade
-
David A SpadeDavid A Spade
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian Learning of Other Agents' Finite Controllers for Interactive POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence