Value-Function Approximations for Partially Observable Markov Decision Processes

M Hauskrecht

doi:10.1613/jair.678

Abstract

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a set of imperfect or noisy observations. The modeling advantage of POMDPs, however, comes at a price -- exact methods for solving them are computationally very expensive and thus applicable in practice only to very simple problems. We focus on efficient approximation (heuristic) methods that attempt to alleviate the computational problem and trade off accuracy for speed. We have two objectives here. First, we survey various approximation methods, analyze their properties and relations and provide some new insights into their differences. Second, we present a number of new approximation methods and novel refinements of existing techniques. The theoretical results are supported by experiments on a problem from the agent navigation domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Artificial Intelligence Research	Publication Date: Aug 1, 2000
Citations: 524	License type: cc-by

R Discovery Prime

R Discovery Prime

Value-Function Approximations for Partially Observable Markov Decision Processes

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Similar Papers

Online Planning Algorithms for POMDPs
S Ross ... B Chaib-Draa
Journal of Artificial Intelligence Research | VOL. 32
S Ross, et. al.S Ross ... B Chaib-Draa
29 Jul 2008
Journal of Artificial Intelligence Research | VOL. 32

Author response: Alternation emerges as a multi-modal strategy for turbulent odor navigation
Gautam Reddy ... Agnese Seminara
-
Gautam Reddy, et. al.Gautam Reddy ... Agnese Seminara
12 Jul 2022
12 Jul 2022

Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions
Majid Khonji
Artificial Intelligence | VOL. 323
Majid KhonjiMajid Khonji
18 Jul 2023
Artificial Intelligence | VOL. 323

Online Partial Conditional Plan Synthesis for POMDPs With Safe-Reachability Objectives: Methods and Experiments
Yue Wang ... Juan David Hernandez
IEEE Transactions on Automation Science and Engineering | VOL. 18
Yue Wang, et. al.Yue Wang ... Juan David Hernandez
01 Jul 2021
IEEE Transactions on Automation Science and Engineering | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Value-Function Approximations for Partially Observable Markov Decision Processes

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research