Policy Bounds for Markov Decision Processes

William S Lovejoy

doi:10.1287/opre.34.4.630

Policy Bounds for Markov Decision Processes

William S Lovejoy

https://doi.org/10.1287/opre.34.4.630

Copy DOI

Journal: Operations Research	Publication Date: Aug 1, 1986
Citations: 16

Affiliation: Georgia Institute of Technology

#Markov Decision Process #Deterministic Counterpart + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper demonstrates how a Markov decision process (MDP) can be approximated to generate a policy bound, i.e., a function that bounds the optimal policy from below or from above for all states. We present sufficient conditions for several computationally attractive approximations to generate rigorous policy bounds. These approximations include approximating the optimal value function, replacing the original MDP with a separable approximate MDP, and approximating a stochastic MDP with its deterministic counterpart. An example from the field of fisheries management demonstrates the practical applicability of the results.

Full Text