Multiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision Processes

Marianne Akian,Stéphane Gaubert,Omar Saadi,Zheng Qu

doi:10.1137/20m1367192

Multiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision Processes

Marianne Akian, Stéphane Gaubert + Show 2 more

Open Access

https://doi.org/10.1137/20m1367192

Copy DOI

Journal: SIAM journal on matrix analysis and applications : a publication of the Society for Industrial and Applied Mathematics	Publication Date: Feb 15, 2022
Citations: 1

#Theory Of Markov Decision Processes #Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We analyze a modified version of the Nesterov accelerated gradient algorithm, which applies to affine fixed point problems with non-self-adjoint matrices, such as the ones appearing in the theory of Markov decision processes with discounted or mean payoff criteria. We characterize the spectra of matrices for which this algorithm does converge with an accelerated asymptotic rate. We also introduce a $d$th-order algorithm and show that it yields a multiply accelerated rate under more demanding conditions on the spectrum. We subsequently apply these methods to develop accelerated schemes for nonlinear fixed point problems arising from Markov decision processes. This is illustrated by numerical experiments.

Full Text