Parameterized Markov decision process and its application to service rate control

Li Xia,Qing-Shan Jia

doi:10.1016/j.automatica.2015.01.006

Abstract

In this paper, we discuss the optimization of Markov decision processes (MDPs) with parameterized policy, where the state space is partitioned and a parameter is assigned to each partition. The goal is to find the optimal parameters which maximize the long-run average performance. The traditional policy iteration is usually inapplicable to parameterized policy because the parameter tuning at different states are correlated. With some appropriate assumptions and special conditions, we develop a modified policy iteration type algorithm to find the optimal parameters. Compared with the traditional gradient-based approaches for MDP with parameterized policy, this policy iteration type approach is much more efficient. Finally, as an example, we apply this approach to a service rate control problem in closed Jackson networks. As compared with the gradient-based approach which is trapped into local optimum, our approach is demonstrated to efficiently find the optimal service rates in global scope.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parameterized Markov decision process and its application to service rate control

Abstract

Talk to us

Similar Papers

More From: Automatica

Lead the way for us

Journal: Automatica	Publication Date: Feb 11, 2015
Citations: 29

Similar Papers

Computationally efficient algorithms for on-line optimization of markov decision processes
A Jalali ... M.J Ferguson
Automatica | VOL. 28
A Jalali, et. al.A Jalali ... M.J Ferguson
01 Jan 1992
Automatica | VOL. 28

Bias optimality and strong n ( [formula omitted]) discount optimality for Markov decision processes
Quanxin Zhu
Journal of Mathematical Analysis and Applications | VOL. 334
Quanxin ZhuQuanxin Zhu
08 Jan 2007
Journal of Mathematical Analysis and Applications | VOL. 334

Light robustness in the optimization of Markov decision processes with uncertain parameters
Peter Buchholz ... Dimitri Scheftelowitsch
Computers and Operations Research | VOL. 108
Peter Buchholz, et. al.Peter Buchholz ... Dimitri Scheftelowitsch
03 Apr 2019
Computers and Operations Research | VOL. 108

First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors
Xiao Wu ... Xianping Guo
Journal of Applied Probability | VOL. 52
Xiao Wu, et. al.Xiao Wu ... Xianping Guo
01 Jun 2015
Journal of Applied Probability | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parameterized Markov decision process and its application to service rate control

Abstract

Talk to us

Similar Papers

More From: Automatica