Partially observable discrete-time discounted Markov games with general utility

Arnab Bhabak,Subhamay Saha

doi:10.1016/j.orl.2024.107113

Partially observable discrete-time discounted Markov games with general utility

Arnab Bhabak, Subhamay Saha

Open Access

https://doi.org/10.1016/j.orl.2024.107113

Copy DOI

Journal: Operations Research Letters

Publication Date: Apr 2, 2024

#Infinite Horizon Games #Discrete Time Markov Chain + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we investigate partially observable zero sum games where the state process is a discrete time Markov chain. We consider a general utility function in the optimization criterion. We show the existence of value for both finite and infinite horizon games and also establish the existence of optimal polices. The main step involves converting the partially observable game into a completely observable game which also keeps track of the total discounted accumulated reward/cost.

Full Text