A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

Warren B Powell,Jun Ma

doi:10.1007/s11768-011-0313-y

Abstract

We review the literature on approximate dynamic programming, with the goal of better understanding the theory behind practical algorithms for solving dynamic programs with continuous and vector-valued states and actions and complex information processes. We build on the literature that has addressed the well-known problem of multidimensional (and possibly continuous) states, and the extensive literature on model-free dynamic programming, which also assumes that the expectation in Bellman’s equation cannot be computed. However, we point out complications that arise when the actions/controls are vector-valued and possibly continuous. We then describe some recent research by the authors on approximate policy iteration algorithms that offer convergence guarantees (with technical assumptions) for both parametric and nonparametric architectures for the value function.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

Abstract

Talk to us

Similar Papers

More From: Journal of Control Theory and Applications

Lead the way for us

Journal: Journal of Control Theory and Applications	Publication Date: Jul 19, 2011
Citations: 121

Similar Papers

A convergent recursive least squares approximate policy iteration algorithm for multi-dimensional Markov decision process with continuous state and action spaces
Jun Ma ... Warren B Powell
-
Jun Ma, et. al.Jun Ma ... Warren B Powell
01 Mar 2009
01 Mar 2009

Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem
Naiming Qi ... Feng Wu
-
Naiming Qi, et. al.Naiming Qi ... Feng Wu
01 Jul 2014
01 Jul 2014

Heuristic Dynamic Programming Nonlinear Optimal Controller
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

Dynamic Programming, Numerical
John Rust
-
John RustJohn Rust
15 Feb 2017
15 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

Abstract

Talk to us

Similar Papers

More From: Journal of Control Theory and Applications