Information Gathering and Reward Exploitation of Subgoals for POMDPs

Hang Ma,Joelle Pineau

doi:10.1609/aaai.v29i1.9659

Abstract

Planning in large partially observable Markov decision processes (POMDPs) is challenging especially when a long planning horizon is required. A few recent algorithms successfully tackle this case but at the expense of a weaker information-gathering capacity. In this paper, we propose Information Gathering and Reward Exploitation of Subgoals (IGRES), a randomized POMDP planning algorithm that leverages information in the state space to automatically generate "macro-actions" to tackle tasks with long planning horizons, while locally exploring the belief space to allow effective information gathering. Experimental results show that IGRES is an effective multi-purpose POMDP solver, providing state-of-the-art performance for both long horizon planning tasks and information-gathering tasks on benchmark domains. Additional experiments with an ecological adaptive management problem indicate that IGRES is a promising tool for POMDP planning in real-world settings.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Information Gathering and Reward Exploitation of Subgoals for POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Mar 4, 2015
Citations: 6

Similar Papers

Multilevel Monte Carlo for solving POMDPs on-line
Marcus Hoerger ... Hanna Kurniawati
The International Journal of Robotics Research | VOL. 42
Marcus Hoerger, et. al.Marcus Hoerger ... Hanna Kurniawati
13 Jun 2022
The International Journal of Robotics Research | VOL. 42

Algorithms for partially observable Markov decision processes
Weihong Zhang
-
Weihong ZhangWeihong Zhang
23 Dec 2014
23 Dec 2014

Learning Logic Specifications for Policy Guidance in POMDPs: an Inductive Logic Programming Approach
Daniele Meli ... Alberto Castellini
Journal of Artificial Intelligence Research | VOL. 79
Daniele Meli, et. al.Daniele Meli ... Alberto Castellini
28 Feb 2024
Journal of Artificial Intelligence Research | VOL. 79

Tractable POMDP-planning for robots with complex non-linear dynamics
Marcus Hoerger
-
Marcus HoergerMarcus Hoerger
16 Mar 2020
16 Mar 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information Gathering and Reward Exploitation of Subgoals for POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence