Online Planning Algorithms for POMDPs

S Ross,S Paquet,J Pineau,B Chaib-Draa

doi:10.1613/jair.2567

S Ross, S Paquet + Show 2 more

Open Access

https://doi.org/10.1613/jair.2567

Copy DOI

Journal: Journal of Artificial Intelligence Research	Publication Date: Jul 29, 2008
Citations: 479	License type: publisher-specific-oa

Abstract

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local policies at each decision step during the execution. Online algorithms generally consist of a lookahead search to find the best action to execute at each time step in an environment. Our objectives here are to survey the various existing online POMDP methods, analyze their properties and discuss their advantages and disadvantages; and to thoroughly evaluate these online approaches in different environments under various metrics (return, error bound reduction, lower bound improvement). Our experimental results indicate that state-of-the-art online heuristic search methods can handle large POMDP domains efficiently.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Planning Algorithms for POMDPs

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Similar Papers

Point-based online value iteration algorithm in large POMDP
Bo Wu ... Hong-Yan Zheng
Applied Intelligence | VOL. 40
Bo Wu, et. al.Bo Wu ... Hong-Yan Zheng
13 Oct 2013
Applied Intelligence | VOL. 40

A Novel Point-Based Incremental Pruning Algorithm for POMDP
Bo Wu ... Hong Yan Zheng
Applied Mechanics and Materials | VOL. 513-517
Bo Wu, et. al.Bo Wu ... Hong Yan Zheng
06 Feb 2014
Applied Mechanics and Materials | VOL. 513-517

A POMDP Approximation Algorithm That Anticipates the Need to Observe
Valentina Bayer Zubek ... Thomas Dietterich
-
Valentina Bayer Zubek, et. al.Valentina Bayer Zubek ... Thomas Dietterich
01 Jan 1999
01 Jan 1999

Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions
Majid Khonji
Artificial Intelligence | VOL. 323
Majid KhonjiMajid Khonji
18 Jul 2023
Artificial Intelligence | VOL. 323

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Planning Algorithms for POMDPs

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research