Decomposing Large-Scale POMDP Via Belief State Analysis

X Li,J Liu,W.K Cheung

doi:10.1109/iat.2005.63

Abstract

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing the optimal policy for a large-scale POMDP is known to be intractable. Belief compression, being an approximate solution, has recently been proposed to reduce the dimension of POMDP's belief state space and shown to be effective in improving the problem tractability. In this paper, with the conjecture that temporally close belief states could be characterized by a lower intrinsic dimension, we propose a spatio-temporal brief clustering that considers both the belief states' spatial (in the belief space) and temporal similarities, as well as incorporate it into the belief compression algorithm. The proposed clustering results in belief state clusters as sub-POMDPs of much lower dimension so as to be distributed to a set of distributed agents for collaborative problem solving. The proposed method has been tested using a synthesized navigation problem (Hallway2) and empirically shown to be able to result in policies of superior long-term rewards when compared with those based on solely belief compression. Some future research directions for extending this belief state analysis approach are also included.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decomposing Large-Scale POMDP Via Belief State Analysis

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Algorithms for partially observable Markov decision processes
Weihong Zhang
-
Weihong ZhangWeihong Zhang
23 Dec 2014
23 Dec 2014

Partially observed Markov decision processes (POMDPs)
Vikram Krishnamurthy
-
Vikram KrishnamurthyVikram Krishnamurthy
01 Jan 2015
01 Jan 2015

Integrating Value-Directed Compression and Belief Space Analysis for POMDP Decomposition
Xin Li ... William Cheung
-
Xin Li, et. al.Xin Li ... William Cheung
01 Jan 2006
01 Jan 2006

A Modified Memory-Based Reinforcement Learning Method for Solving POMDP Problems
Lei Zheng ... Siu-Yeung Cho
Neural Processing Letters | VOL. 33
Lei Zheng, et. al.Lei Zheng ... Siu-Yeung Cho
19 Feb 2011
Neural Processing Letters | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decomposing Large-Scale POMDP Via Belief State Analysis

Abstract

Talk to us

Similar Papers