Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria

José Daniel López-Barrientos

doi:10.1007/s40305-014-0061-z

Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria

José Daniel López-Barrientos

Open Access

https://doi.org/10.1007/s40305-014-0061-z

Copy DOI

Journal: Journal of the Operations Research Society of China	Publication Date: Nov 28, 2014
Citations: 26

Affiliation: Universidad Anáhuac

#Policy Iteration Algorithm #Nondegenerate Diffusion + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper studies the policy iteration algorithm (PIA) for zero-sum stochastic differential games with the basic long-run average criterion, as well as with its more selective version, the so-called bias criterion. The system is assumed to be a nondegenerate diffusion. We use Lyapunov-like stability conditions that ensure the existence and boundedness of the solution to certain Poisson equation. We also ensure the convergence of a sequence of such solutions, of the corresponding sequence of policies, and, ultimately, of the PIA.

Full Text