Online calibrated forecasts: Memory efficiency versus universality for learning in games

Shie Mannor,Gürdal Arslan,Jeff S Shamma

doi:10.1007/s10994-006-0219-y

Abstract

We provide a simple learning process that enables an agent to forecast a sequence of outcomes. Our forecasting scheme, termed tracking forecast, is based on tracking the past observations while emphasizing recent outcomes. As opposed to other forecasting schemes, we sacrifice universality in favor of a significantly reduced memory requirements. We show that if the sequence of outcomes has certain properties--it has some internal (hidden) state that does not change too rapidly--then the tracking forecast is weakly calibrated so that the forecast appears to be correct most of the time. For binary outcomes, this result holds without any internal state assumptions. We consider learning in a repeated strategic game where each player attempts to compute some forecast of the opponent actions and play a best response to it. We show that if one of the players uses a tracking forecast, while the other player uses a standard learning algorithm (such as exponential regret matching or smooth fictitious play), then the player using the tracking forecast obtains the best response to the actual play of the other players. We further show that if both players use tracking forecast, then under certain conditions on the game matrix, convergence to a Nash equilibrium is possible with positive probability for a larger class of games than the class of games for which smooth fictitious play converges to a Nash equilibrium.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online calibrated forecasts: Memory efficiency versus universality for learning in games

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Sep 27, 2006
Citations: 53

Similar Papers

Pre-Processing Structured Data for Standard Machine Learning Algorithms by Supervised Graph Propositionalization - A Case Study with Medicinal Chemistry Datasets
Thashmee Karunaratne ... Henrik Bostrom
-
Thashmee Karunaratne, et. al.Thashmee Karunaratne ... Henrik Bostrom
01 Dec 2010
01 Dec 2010

An under-sampling imbalanced learning of data gravitation based classification
Lizhi Peng ... Xiaoqing Zhou
-
Lizhi Peng, et. al.Lizhi Peng ... Xiaoqing Zhou
01 Aug 2016
01 Aug 2016

Sampled fictitious play is Hannan consistent
Zifan Li ... Ambuj Tewari
Games and Economic Behavior | VOL. 109
Zifan Li, et. al.Zifan Li ... Ambuj Tewari
05 Feb 2018
Games and Economic Behavior | VOL. 109

Stochastic frontier estimation of efficient learning in video games
Karla R Hamlen
Computers & Education | VOL. 58
Karla R HamlenKarla R Hamlen
12 Sep 2011
Computers & Education | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online calibrated forecasts: Memory efficiency versus universality for learning in games

Abstract

Talk to us

Similar Papers

More From: Machine Learning