Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors

Héctor Jasso-Fuentes,Raquiel R. López-Martínez,J. Adolfo Minjárez-Sosa

doi:10.1080/02331934.2022.2130699

Abstract

ABSTRACT This paper addresses a class of discrete-time Markov decision processes in Borel spaces with a finite number of cost constraints. The constrained control model considers costs of discounted type with state-dependent discount factors which are subject to external disturbances. Our objective is to prove the existence of optimal control policies and characterize them according to certain optimality criteria. Specifically, by rewriting appropriately our original constrained problem as a new one on a space of occupation measures, we apply the direct method to show solvability. Next, the problem is defined as a convex program, and we prove that the existence of a saddle point of the associated Lagrangian operator is equivalent to the existence of an optimal control policy for the constrained problem. Finally, we turn our attention to multi-objective optimization problems, where the existence of Pareto optimal policies can be obtained from the existence of saddle-points of the aforementioned Lagrangian or equivalently from the existence of optimal control policies of constrained problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors

Abstract

Talk to us

Similar Papers

More From: Optimization

Lead the way for us

Journal: Optimization	Publication Date: Oct 8, 2022
Citations: 3

Similar Papers

Constrained Markov Decision Processes with Non-constant Discount Factor
Héctor Jasso-Fuentes ... Tomás Prieto-Rumeau
Journal of Optimization Theory and Applications | VOL. 202
Héctor Jasso-Fuentes, et. al.Héctor Jasso-Fuentes ... Tomás Prieto-Rumeau
30 May 2024
Journal of Optimization Theory and Applications | VOL. 202

Constrained Markov decision processes in Borel spaces: from discounted to average optimality
Armando F Mendoza-Pérez ... Omar A De-La-Cruz Courtois
Mathematical Methods of Operations Research | VOL. 84
Armando F Mendoza-Pérez, et. al.Armando F Mendoza-Pérez ... Omar A De-La-Cruz Courtois
20 Jun 2016
Mathematical Methods of Operations Research | VOL. 84

Discrete-time control with non-constant discount factor
Héctor Jasso-Fuentes ... Tomás Prieto-Rumeau
Mathematical Methods of Operations Research | VOL. 92
Héctor Jasso-Fuentes, et. al.Héctor Jasso-Fuentes ... Tomás Prieto-Rumeau
27 Jun 2020
Mathematical Methods of Operations Research | VOL. 92

Average cost Markov Decision Processes: Optimality conditions
O Hernández-Lerma
Journal of Mathematical Analysis and Applications | VOL. 158
O Hernández-LermaO Hernández-Lerma
01 Jul 1991
Journal of Mathematical Analysis and Applications | VOL. 158

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors

Abstract

Talk to us

Similar Papers

More From: Optimization