Linear Program for Communicating MDPs with Multiple Constraints

Jerzy A. Filar,Xianping Guo

doi:10.1007/978-1-4613-0265-0_14

Linear Program for Communicating MDPs with Multiple Constraints

Jerzy A. Filar, Xianping Guo

https://doi.org/10.1007/978-1-4613-0265-0_14

Copy DOI

Publication Date: Jan 1, 2002

Citations: 1

Affiliation: University of South Australia, Sun Yat-sen University

#Markov Decision Processes #Average Reward Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, a mapping is developed between the ‘multichain’ and ‘unchain’ linear programs for average reward Markov decision processes (MDPs) with multiple constraints on average expected costs. Our approach applies the communicating properties of MDPs. The mapping is used not only to prove that the unichain linear program solves the average reward communicating MDPs with multiple constraints on average expected costs, but also to demonstrate that the optimal gain for the communicating MDPs with multiple constraints on average expected costs is constant.

Full Text