DISCRETE DYNAMIC PROGRAMMING WITH RECURSIVE ADDITIVE SYSTEM

Seiichi Iwamoto

doi:10.5109/13082

Abstract

In the paper [5], N. Furukawa and S. Iwamoto have defined Markovian decision processes with a new broad class of reward systems, that is, recursive reward functions, and have studied the existence and properties of optimal policies. Under some conditions on the reward functions, they have proved that there exists a (p, s)-optimal stationary policy and that in the case of a finite action space there exists an optimal stationary policy. These are some generalizations of results by D. Blackwell [3]. In this paper the author defines a dynamic programming problem with a recursive additive system which is referred to one type of Markovian decision processes with recursive reward functions defined by the previous authors [5]. This paper gives an algorithm for finding optimal stationary policies in the dynamic programming with the recursive additive system in the case of finite state and action spaces. Furthermore, we give several interesting examples with numerical computations to obtain optimal policies. The motivation to consider the dynamic programming problem with the recursive additive system is the following : If we restrict the reward in narrow sense, for instance, the money in economic systems or the loss in statistical decision problems, it will be appropriate for us to accept the total sum of stage-wise rewards as a performance index. That is so-called additive reward system. But many practical problems in the field of engineerings enable us to interpret the reward in wider sense. In those problems we often encounter much complicated reward systems that are more than so-called additive. We have an interesting class of such complicated reward systems in which we can find a common feature named recursive additive . By talking about various reward systems belonging to this class at the same time, we can make clear, as a dynamic programming problem, an important common property within the class, Our proofs are partially owing to Blackwell [2].

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bulletin of Mathematical Statistics	Publication Date: Mar 1, 1974
Citations: 7	License type: other-oa

R Discovery Prime

R Discovery Prime

DISCRETE DYNAMIC PROGRAMMING WITH RECURSIVE ADDITIVE SYSTEM

Abstract

Talk to us

Similar Papers

More From: Bulletin of Mathematical Statistics

Lead the way for us

Similar Papers

Two-person zero-sum stochastic games
Melike Baykal-Gürsoy
Annals of Operations Research | VOL. 28
Melike Baykal-GürsoyMelike Baykal-Gürsoy
01 Dec 1991
Annals of Operations Research | VOL. 28

A sample-path approach to stochastic games
M Baykal-Gursoy
-
M Baykal-GursoyM Baykal-Gursoy
13 Dec 1989
13 Dec 1989

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

Variability sensitive Markov decision processes
M Baykal-Gursoy ... K.W Ross
-
M Baykal-Gursoy, et. al.M Baykal-Gursoy ... K.W Ross
13 Dec 1989
13 Dec 1989

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DISCRETE DYNAMIC PROGRAMMING WITH RECURSIVE ADDITIVE SYSTEM

Abstract

Talk to us

Similar Papers

More From: Bulletin of Mathematical Statistics