Reduced complexity dynamic programming based on policy iteration

David S Bayard

doi:10.1016/0022-247x(92)90007-z

David S Bayard

Open Access

https://doi.org/10.1016/0022-247x(92)90007-z

Copy DOI

Abstract

In this paper, a forward method is introduced for solving the dynamic programming equations of Bellman. This is in contrast to most existing methods for dynamic programming which solve the problem backwards. A key advantage is that the forward dynamic programming approach can be systematically simplified to provide computation/optimality trade-offs. Such trade-offs are lacking in backwards iterative methods which tend to be “all or nothing” propositions. A second advantage is that the computation is independent of the state dimension. These properties together offer some promise for circumventing the “curse of dimensionality” on many problems of practical interest. Due to a strong connection with the work of Bellman on policy iteration, the method is denoted as the Iteration in Policy Space (IPS) algorithm. Several examples are given to demonstrate the general usefulness of the method.

Full Text