A Model for Multi-timescaled Sequential Decision-making Processes with Adversary

H.S Chang

doi:10.1080/13873950412331335261

Abstract

Extending the multi-timescale model proposed by the author et al. in the context of Markov decision processes, this paper proposes a simple analytical model called M timescale two-person zero-sum Markov Games (MMGs) for hierarchically structured sequential decision-making processes in two players' competitive situations where one player (the minimizer) wishes to minimize their cost that will be paid to the adversary (the maximizer). In this hierarchical model, for each player, decisions in each level in the M-level hierarchy are made in M different discrete timescales and the state space and the control space of each level in the hierarchy are non-overlapping with those of the other levels, respectively, and the hierarchy is structured in a "pyramid" sense such that a decision made at level m (slower timescale) state and/or the state will affect the evolutionary decision making process of the lower-level m+1 (faster timescale) until a new decision is made at the higher level but the lower-level decisions themselves do not affect the transition dynamics of higher levels. The performance produced by the lower-level decisions will affect the higher level decisions for each player. A hierarchical objective function for the minimizer and the maximizer is defined, and from this we define "multi-level equilibrium value function" and derive a "multi-level equilibrium equation". We also discuss how to solve hierarchical games exactly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Model for Multi-timescaled Sequential Decision-making Processes with Adversary

Abstract

Talk to us

Similar Papers

More From: Mathematical and Computer Modelling of Dynamical Systems

Lead the way for us

Journal: Mathematical and Computer Modelling of Dynamical Systems	Publication Date: Sep 1, 2004
Citations: 1

Similar Papers

Multitime scale markov decision processes
Hyeong Soo Chang ... M Shayman
IEEE Transactions on Automatic Control | VOL. 48
Hyeong Soo Chang, et. al. Hyeong Soo Chang ... M Shayman
01 Jun 2003
IEEE Transactions on Automatic Control | VOL. 48

A model for multi-time scaled sequential decision making processes
Hyeong Soo Chang ... M.A Shayman
-
Hyeong Soo Chang, et. al. Hyeong Soo Chang ... M.A Shayman
10 Dec 2002
10 Dec 2002

Hierarchy and monophyly.
Adam Skinner
Cladistics : the international journal of the Willi Hennig Society | VOL. 20
Adam SkinnerAdam Skinner
01 Oct 2004
Cladistics : the international journal of the Willi Hennig Society | VOL. 20

Emotional intelligence predicts individual differences in proneness for flow among musicians: the role of control and distributed attention.
Narayanan Srinivasan ... Bruno Gingras
Frontiers in psychology | VOL. 5
Narayanan Srinivasan, et. al.Narayanan Srinivasan ... Bruno Gingras
17 Jun 2014
Frontiers in psychology | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Model for Multi-timescaled Sequential Decision-making Processes with Adversary

Abstract

Talk to us

Similar Papers

More From: Mathematical and Computer Modelling of Dynamical Systems