Simulation-based policy generation using large-scale Markov decision processes

C.W Zobel,W.T Scherer

doi:10.1109/3468.983417

Abstract

This paper presents a new problem-solving approach, termed simulation-based policy generation (SPG), that is able to generate solutions to problems that may otherwise be computationally intractable. The SPG method uses a simulation of the original problem to create an approximating Markov decision process (MDP) model which is then solved via traditional MDP solution approaches. Since this approximating MDP is a fairly rich and robust sequential optimization model, solution policies can be created which represent an intelligent and structured search of the policy space. An important feature of the SPG approach is its adaptive nature, in that it uses the original simulation model to generate improved aggregation schemes, allowing the approach to be applied in situations where the underlying problem structure is largely unknown. In order to illustrate the performance of the SPG methodology, we apply it to a common but computationally complex problem of inventory control, and we briefly discuss its application to a large-scale telephone network routing problem.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Simulation-based policy generation using large-scale Markov decision processes

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society

Lead the way for us

Journal: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society	Publication Date: Jan 1, 2001
Citations: 15

Similar Papers

Approximate Value Iteration for Risk-Aware Markov Decision Processes
Pengqian Yu ... Huan Xu
IEEE Transactions on Automatic Control | VOL. 63
Pengqian Yu, et. al.Pengqian Yu ... Huan Xu
01 Sep 2018
IEEE Transactions on Automatic Control | VOL. 63

An application of simulation for large-scale Markov decision processes to a problem in telephone network routing
C Zobel ... W.T Scherer
-
C Zobel, et. al.C Zobel ... W.T Scherer
11 Oct 1998
11 Oct 1998

Dynamic target tracking based on corner enhancement with Markov decision process
Guoyu Zuo ... Lei Ma
The Journal of Engineering | VOL. 2018
Guoyu Zuo, et. al.Guoyu Zuo ... Lei Ma
17 Sep 2018
The Journal of Engineering | VOL. 2018

A learning from demonstration framework to promote home-based neuromotor rehabilitation
Yuanliang Meng ... Yi-Ning Wu
-
Yuanliang Meng, et. al.Yuanliang Meng ... Yi-Ning Wu
01 Aug 2016
01 Aug 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simulation-based policy generation using large-scale Markov decision processes

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society