A probabilistic analysis of bias optimality in unichain Markov decision processes

M.E Lewis,M.L Puterman

doi:10.1109/9.898698

A probabilistic analysis of bias optimality in unichain Markov decision processes

M.E Lewis, M.L Puterman

https://doi.org/10.1109/9.898698

Copy DOI

Journal: IRE Transactions on Automatic Control	Publication Date: Jan 1, 2001
Citations: 47

#Relative Value Functions #Finite State Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Focuses on bias optimality in unichain, finite state, and action-space Markov decision processes. Using relative value functions, we present methods for evaluating optimal bias, this leads to a probabilistic analysis which transforms the original reward problem into a minimum average cost problem. The result is an explanation of how and why bias implicitly discounts future rewards.

Full Text