Lower bounding aggregation and direct computation for an infinite horizon one-reservoir model

Bernard F Lamond,Pascal Lang

doi:10.1016/0377-2217(96)00262-7

Abstract

We present a specialized policy iteration method for the computation of optimal and approximately optimal policies for a discrete-time model of a single reservoir whose discharges generate hydroelectric power. The model is described in (Lamond et al., 1995) and (Drouin et al., 1996), where the special structure of optimal policies is given and an approximate value iteration method is presented, using piecewise affine approximations of the optimal return functions. Here, we present a finite method for computing an optimal policy in O( n 3) arithmetic operations, where n is the number of states in the associated Markov decision process, and a finite method for computing a lower bound on the optimal value function in O( m 2 n) where m is the number of nodes of the piecewise affine approximation.

Full Text