The factored policy-gradient planner

Olivier Buffet,Douglas Aberdeen

doi:10.1016/j.artint.2008.11.008

Abstract

We present an any-time concurrent probabilistic temporal planner (CPTP) that includes continuous and discrete uncertainties and metric functions. Rather than relying on dynamic programming our approach builds on methods from stochastic local policy search. That is, we optimise a parameterised policy using gradient ascent. The flexibility of this policy-gradient approach, combined with its low memory use, the use of function approximation methods and factorisation of the policy, allow us to tackle complex domains. This factored policy gradient (FPG) planner can optimise steps to goal, the probability of success, or attempt a combination of both. We compare the FPG planner to other planners on CPTP domains, and on simpler but better studied non-concurrent non-temporal probabilistic planning (PP) domains. We present FPG- ipc, the PP version of the planner which has been successful in the probabilistic track of the fifth international planning competition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Artificial Intelligence	Publication Date: Nov 27, 2008
Citations: 84	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

The factored policy-gradient planner

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence

Lead the way for us

Similar Papers

Deterministic planning in the fifth international planning competition: PDDL3 and experimental evaluation of the planners
Alfonso E Gerevini ... Yannis Dimopoulos
Artificial Intelligence | VOL. 173
Alfonso E Gerevini, et. al.Alfonso E Gerevini ... Yannis Dimopoulos
21 Nov 2008
Artificial Intelligence | VOL. 173

Review on Certain Group Planning Domain of IPC
Nurbol Luktarhan ... Nan Nan Xie
Applied Mechanics and Materials | VOL. 275-277
Nurbol Luktarhan, et. al.Nurbol Luktarhan ... Nan Nan Xie
01 Jan 2013
Applied Mechanics and Materials | VOL. 275-277

The first learning track of the international planning competition
Alan Fern ... Prasad Tadepalli
Machine Learning | VOL. 84
Alan Fern, et. al.Alan Fern ... Prasad Tadepalli
31 Jan 2011
Machine Learning | VOL. 84

Planning Through Stochastic Local Search and Temporal Action Graphs in LPG
A Gerevini ... I Serina
Journal of Artificial Intelligence Research | VOL. 20
A Gerevini, et. al.A Gerevini ... I Serina
01 Dec 2003
Journal of Artificial Intelligence Research | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The factored policy-gradient planner

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence