Incremental Policy Generation for Finite-Horizon DEC-POMDPs

Christopher Amato,Shlomo Zilberstein,Jilles Dibangoye

doi:10.1609/icaps.v19i1.13355

Abstract

Solving multiagent planning problems modeled as DEC-POMDPs is an important challenge. These models are often solved by using dynamic programming, but the high resource usage of current approaches results in limited scalability. To improve the efficiency of dynamic programming algorithms, we propose a new backup algorithm that is based on a reachability analysis of the state space. This method, which we call incremental policy generation, can be used to produce an optimal solution for any possible initial state or further scalability can be achieved by making use of a known start state. When incorporated into the optimal dynamic programming algorithm, our experiments show that planning horizon can be increased due to a marked reduction in resource consumption. This approach also fits nicely with approximate dynamic programming algorithms. To demonstrate this, we incorporate it into the state-of-the-art PBIP algorithm and show significant performance gains. The results suggest that the performance of other dynamic programming algorithms for DEC-POMDPs could be similarly improved by integrating the incremental policy generation approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Incremental Policy Generation for Finite-Horizon DEC-POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: Oct 16, 2009
Citations: 43

Similar Papers

Transmission Optimization for Hybrid Half/Full-Duplex Relay With Energy Harvesting
Jie Gong ... Xiang Chen
IEEE Transactions on Wireless Communications | VOL. 17
Jie Gong, et. al.Jie Gong ... Xiang Chen
01 May 2018
IEEE Transactions on Wireless Communications | VOL. 17

Use of Approximate Dynamic Programming for Production Optimization
Benjamin Van Roy ... Zheng Wen
-
Benjamin Van Roy, et. al.Benjamin Van Roy ... Zheng Wen
21 Feb 2011
21 Feb 2011

A Fast Algorithm in Exponential Change-Points Model with Comparison
Kuo-Ching Chang ... Chung-Bow Lee
-
Kuo-Ching Chang, et. al.Kuo-Ching Chang ... Chung-Bow Lee
01 Dec 2010
01 Dec 2010

Finite‐Horizon ε‐Optimal Tracking Control of Discrete‐Time Linear Systems Using Iterative Approximate Dynamic Programming
Fuxiao Tan ... Xinping Guan
Asian Journal of Control | VOL. 17
Fuxiao Tan, et. al.Fuxiao Tan ... Xinping Guan
23 Jan 2014
Asian Journal of Control | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incremental Policy Generation for Finite-Horizon DEC-POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling