Semi-markov decision problems and performance sensitivity analysis

Xi-Ren Cao Xi-Ren Cao

doi:10.1109/tac.2003.811252

Abstract

Recent research indicates that Markov decision processes (MDPs) can be viewed from a sensitivity point of view; and the perturbation analysis (PA), MDPs, and reinforcement learning (RL) are three closely related areas in optimization of discrete-event dynamic systems that can be modeled as Markov processes. The goal of this paper is two-fold. First, we develop the PA theory for semi-Markov processes (SMPs); and then we extend the aforementioned results about the relation among PA, MDP, and RL to SMPs. In particular, we show that performance sensitivity formulas and policy iteration algorithms of semi-Markov decision processes can be derived based on the performance potential and realization matrix. Both the long-run average and discounted-cost problems are considered. This approach provides a unified framework for both problems, and the long-run average problem corresponds to the discounted factor being zero. The results indicate that performance sensitivities and optimization depend only on first-order statistics. Single sample path-based implementations are discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-markov decision problems and performance sensitivity analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control

Lead the way for us

Journal: IEEE Transactions on Automatic Control	Publication Date: May 1, 2003
Citations: 70

Similar Papers

Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion
M Yu Kitayev
Theory of Probability & Its Applications | VOL. 30
M Yu KitayevM Yu Kitayev
01 Jun 1986
Theory of Probability & Its Applications | VOL. 30

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

VMDP-program for solving optimality problems in vector criterion Markov and semi-Markov decision processes
J Novák
Optimization | VOL. 22
J NovákJ Novák
01 Jan 1991
Optimization | VOL. 22

Performance sensitivity analysis and optimization for a class of countable semi-Markov decision processes
Yu Kang ... Hongsheng Xi
-
Yu Kang, et. al. Yu Kang ... Hongsheng Xi
01 Jun 2011
01 Jun 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-markov decision problems and performance sensitivity analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control