Importance sampling for online planning under uncertainty

Yuanfu Luo,Wee Sun Lee,Haoyu Bai,David Hsu

doi:10.1177/0278364918780322

Abstract

The partially observable Markov decision process (POMDP) provides a principled general framework for robot planning under uncertainty. Leveraging the idea of Monte Carlo sampling, recent POMDP planning algorithms have scaled up to various challenging robotic tasks, including, real-time online planning for autonomous vehicles. To further improve online planning performance, this paper presents IS-DESPOT, which introduces importance sampling to DESPOT, a state-of-the-art sampling-based POMDP algorithm for planning under uncertainty. Importance sampling improves DESPOT’s performance when there are critical, but rare events, which are difficult to sample. We prove that IS-DESPOT retains the theoretical guarantee of DESPOT. We demonstrate empirically that importance sampling significantly improves the performance of online POMDP planning for suitable tasks. We also present a general method for learning the importance sampling distribution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Importance sampling for online planning under uncertainty

Abstract

Talk to us

Similar Papers

More From: The International Journal of Robotics Research

Lead the way for us

Journal: The International Journal of Robotics Research	Publication Date: Jun 19, 2018
Citations: 38

Similar Papers

DESPOT: Online POMDP Planning with Regularization
Nan Ye ... Adhiraj Somani
Journal of Artificial Intelligence Research | VOL. 58
Nan Ye, et. al.Nan Ye ... Adhiraj Somani
26 Jan 2017
Journal of Artificial Intelligence Research | VOL. 58

Hybrid Heuristic Online Planning for POMDPs
Zong-Zhang Zhang ... Xiao-Ping Chen
Journal of Software | VOL. 24
Zong-Zhang Zhang, et. al.Zong-Zhang Zhang ... Xiao-Ping Chen
16 Jan 2014
Journal of Software | VOL. 24

POMDPs for robotic tasks with mixed observability
S C W Ong ... S W Png
-
S C W Ong, et. al.S C W Ong ... S W Png
28 Jun 2009
28 Jun 2009

Robotic manipulation of multiple objects as a POMDP
Joni Pajarinen ... Ville Kyrki
Artificial Intelligence | VOL. 247
Joni Pajarinen, et. al.Joni Pajarinen ... Ville Kyrki
03 Apr 2015
Artificial Intelligence | VOL. 247

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Importance sampling for online planning under uncertainty

Abstract

Talk to us

Similar Papers

More From: The International Journal of Robotics Research