Time aggregated Markov decision processes via standard dynamic programming

Edilson F Arruda,Marcelo D Fragoso

doi:10.1016/j.orl.2011.03.006

Time aggregated Markov decision processes via standard dynamic programming

Edilson F Arruda, Marcelo D Fragoso

https://doi.org/10.1016/j.orl.2011.03.006

Copy DOI

Journal: Operations Research Letters	Publication Date: Mar 23, 2011
Citations: 6

Affiliation: Laboratório Nacional de Computação Científica, Pontifícia Universidade Católica do Rio Grande do Sul

#Finite State Markov Decision Processes #Standard Dynamic Programming + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This note addresses the time aggregation approach to ergodic finite state Markov decision processes with uncontrollable states. We propose the use of the time aggregation approach as an intermediate step toward constructing a transformed MDP whose state space is comprised solely of the controllable states. The proposed approach simplifies the iterative search for the optimal solution by eliminating the need to define an equivalent parametric function, and results in a problem that can be solved by simpler, standard MDP algorithms.

Full Text