Faster algorithm and sharper analysis for constrained Markov decision process

Tianjiao Li,Yingbin Liang,Ziwei Guan,Tengyu Xu,Shaofeng Zou,Guanghui Lan

doi:10.1016/j.orl.2024.107107

Faster algorithm and sharper analysis for constrained Markov decision process

Tianjiao Li, Yingbin Liang + Show 4 more

Open Access

https://doi.org/10.1016/j.orl.2024.107107

Copy DOI

Journal: Operations Research Letters

Publication Date: Mar 6, 2024

#Nesterov's Accelerated Gradient Method #Constrained Markov Decision Process + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated reward subject to constraints on its utilities/costs. We propose a new primal-dual approach with a novel integration of entropy regularization and Nesterov's accelerated gradient method. The proposed approach is shown to converge to the global optimum with a complexity of O˜(1/ϵ) in terms of the optimality gap and the constraint violation, which improves the complexity of the existing primal-dual approaches by a factor of O(1/ϵ).

Full Text