On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies

Huizhen Yu

doi:10.1016/j.jmaa.2021.125954

On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies

Huizhen Yu

https://doi.org/10.1016/j.jmaa.2021.125954

Copy DOI

Journal: Journal of Mathematical Analysis and Applications	Publication Date: Dec 28, 2021
Citations: 1

Affiliation: University of Alberta

#Long-run Average Cost Criteria #Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We consider Markov decision processes (MDPs) with Borel state and action spaces and universally measurable policies. For several long-run average cost criteria and two classes of MDPs, we prove sufficient conditions for the optimal average cost functions to be constant almost everywhere with respect to certain σ-finite measures. Besides suitable boundedness conditions on the positive parts of the one-stage costs, the key condition here is that each subset of states with positive measure be reachable with probability one under some policy. Our proofs exploit an inequality for the optimal average cost functions and its connection with submartingales, and, in a special case that involves stationary policies, also use the theory of recurrent Markov chains.

Full Text