Evaluating Effects of Two Alternative Filters for the Incremental Pruning Algorithm on Quality of Pomdp Exact Solutions

Mahdi Naser-Moghadasi

doi:10.4236/ijis.2012.21001

Abstract

Decision making is one of the central problems in artificial intelligence and specifically in robotics. In most cases this problem comes with uncertainty both in data received by the decision maker/agent and in the actions performed in the environment. One effective method to solve this problem is to model the environment and the agent as a Partially Observable Markov Decision Process (POMDP). A POMDP has a wide range of applications such as: Machine Vision, Marketing, Network troubleshooting, Medical diagnosis etc. In recent years, there has been a significant interest in developing techniques for finding policies for (POMDPs).We consider two new techniques, called Recursive Point Filter (RPF) and Scan Line Filter (SCF) based on Incremental Pruning (IP) POMDP solver to introduce an alternative method to Linear Programming (LP) filter for IP. Both, RPF and SCF have solutions for several POMDP problems that LP could not converge to in 24 hours. Experiments are run on problems from POMDP literature, and an Average Discounted Reward (ADR) is computed by testing the policy in a simulated environment.

Highlights

One of the most challenging tasks of an intelligent decision maker or agent is planning, or choosing how to act in such of interactions with environment
There has been a significant interest in developing techniques for finding policies for (POMDPs).We consider two new techniques, called Recursive Point Filter (RPF) and Scan Line Filter (SCF) based on Incremental Pruning (IP) Partially Observable Markov Decision Process (POMDP) solver to introduce an alternative method to Linear Programming (LP) filter for IP
We considered two new filtering techniques, called Recursive Point Filter (RPF) and Scan Line Filter (SCF for Incremental Pruning (IP) POMDP solver to introduce an alternative method for Linear Programming (LP) filter

Summary

Introduction

One of the most challenging tasks of an intelligent decision maker or agent is planning, or choosing how to act in such of interactions with environment. Such agent/environment interactions can be often be effectively modelled as a Partially Observable Markov Decision Process (POMDPs).Operation research [1,2] and stochastic control [3] are two domains where this model can be applied for balancing between competing objectives, action costs, uncertainty of action effects and observations that provide incomplete knowledge about the world. In the context for a POMDP, corresponds to finding an optimal policy for the agent to follow.

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Intelligence Science	Publication Date: Jan 1, 2012
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Evaluating Effects of Two Alternative Filters for the Incremental Pruning Algorithm on Quality of Pomdp Exact Solutions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Intelligence Science

Lead the way for us

Similar Papers

A New Graphical Recursive Pruning Method for the Incremental Pruning Algorithm
Mahdi Naser-Moghadasi
-
Mahdi Naser-MoghadasiMahdi Naser-Moghadasi
01 Jan 2009
01 Jan 2009

Closing the learning-planning loop with predictive state representations
...
-
, et. al. ...
10 May 2010
10 May 2010

Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs.
Finale Doshi ... Nicholas Roy
-
Finale Doshi, et. al.Finale Doshi ... Nicholas Roy
05 Jul 2008
05 Jul 2008

A New Pruning Method for Incremental Pruning Algorithm Using a Sweeping Scan-Line through the Belief Space
Mahdi Naser-Moghadasi
-
Mahdi Naser-MoghadasiMahdi Naser-Moghadasi
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating Effects of Two Alternative Filters for the Incremental Pruning Algorithm on Quality of Pomdp Exact Solutions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Intelligence Science