Dynamic Non Bayesian Decision Making

Dov Monderer,Moshe Tennenholtz

doi:10.2139/ssrn.15575

Abstract

The model of a non-Bayesian agent who faces a repeated game with incomplete informationagainst Nature is an appropriate tool for modeling general agent- environment interactions. In such a modelthe environment state (controlled by Nature) may change arbitrarily and the reward function is initially unknown. The agent is non-bayesian, that is he does not form a prior probability neither on the state selection strategy of Nature, nor on his reward function. Two basic feedback structure are considered. In one of them- the perfect mopnitoring case- the agent is able to observe the previous environment state as part of his feedback, while in the other - the imperfect monitoring case- all that is available to the agent is the reward obtained. Both of these setting refer to partially observable processes, where the current environment state is unknown. Our main result refers to the competitive ratio criterion in the perfect monitoring case; We prove the existence of an efficient stochastic policy which ensures that the competitive ratio is obtained at almost all stages with an arbitrary high probability, where efficiency is measured in term of rate of convergence. It is further shown that such an optimal strategy does not exist in the imperfect monitoring case. Moreover, it is proved that in the perfect monitoring case there does not exist a deterministic policy that satisfy our long run optimality criterion. In addition we discuss the maxmin criterion and prove that a deterministic efficient optimal strategy does exist in the imperfect monitoring case under this criterion. Finally we show that our approach to long-run optimality can be vied as qualitative, which distinguishes it from previous work in this area.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Non Bayesian Decision Making

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal

Lead the way for us

Journal: SSRN Electronic Journal	Publication Date: Jan 1, 1997
Citations: 1

Similar Papers

Dynamic Non-Bayesian Decision Making
D Monderer ... M Tennenholtz
Journal of Artificial Intelligence Research | VOL. 7
D Monderer, et. al.D Monderer ... M Tennenholtz
01 Nov 1997
Journal of Artificial Intelligence Research | VOL. 7

Dynamic non-Bayesian decision making in multi-agent systems
Dov Monderer ... Moshe Tennenholtz
Annals of Mathematics and Artificial Intelligence | VOL. 25
Dov Monderer, et. al.Dov Monderer ... Moshe Tennenholtz
01 Jan 1998
Annals of Mathematics and Artificial Intelligence | VOL. 25

Model-based learning of interaction strategies in multi-agent systems
David Carmel ... Shaul Markovitch
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 10
David Carmel, et. al.David Carmel ... Shaul Markovitch
01 Jul 1998
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 10

Learning optimal dialogue management rules by using reinforcement learning and inductive logic programming
Renaud Lecœuche
-
Renaud LecœucheRenaud Lecœuche
01 Jan 2001
01 Jan 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Non Bayesian Decision Making

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal