Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Dimitri P Bertsekas

doi:10.1109/tac.2019.2896049

Abstract

In this paper, we consider a broad class of infinite horizon discrete-time optimal control models that involve a nonnegative cost function and an affine mapping in their dynamic programming equation. They include as special cases several classical models, such as stochastic undiscounted nonnegative cost problems, stochastic multiplicative cost problems, and risk-sensitive problems with exponential cost. We focus on the case where the state space is finite and the control space has some compactness properties, and we emphasize shortest path-type models. We assume that the affine mapping has a semicontractive character, whereby for some policies it is a contraction, whereas for others it is not. In one line of analysis, we impose assumptions guaranteeing that the noncontractive policies cannot be optimal. Under these assumptions, we prove strong results that resemble those for discounted Markovian decision problems, such as the uniqueness of solution of Bellman's equation, and the validity of forms of value and policy iteration. In the absence of these assumptions, the results are weaker and unusual in character: the optimal cost function need not be a solution of Bellman's equation, and may not be found by value or policy iteration. Instead the optimal cost function over just the contractive policies is the largest solution of Bellman's equation, and can be computed by a variety of algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control

Lead the way for us

Journal: IEEE Transactions on Automatic Control	Publication Date: Aug 1, 2019
Citations: 28

Similar Papers

A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies
Huizhen Yu ... Dimitri P Bertsekas
Mathematics of Operations Research | VOL. 40
Huizhen Yu, et. al.Huizhen Yu ... Dimitri P Bertsekas
01 Oct 2015
Mathematics of Operations Research | VOL. 40

Regular Policies in Abstract Dynamic Programming
Dimitri P Bertsekas
SIAM Journal on Optimization | VOL. 27
Dimitri P BertsekasDimitri P Bertsekas
01 Jan 2017
SIAM Journal on Optimization | VOL. 27

Proper Policies in Infinite-State Stochastic Shortest Path Problems
Dimitri P Bertsekas
IEEE Transactions on Automatic Control | VOL. 63
Dimitri P BertsekasDimitri P Bertsekas
01 Nov 2018
IEEE Transactions on Automatic Control | VOL. 63

On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes
Huizhen Yu
SIAM Journal on Control and Optimization | VOL. 53
Huizhen YuHuizhen Yu
01 Jan 2015
SIAM Journal on Control and Optimization | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control