The Operator Approach to Entropy Games

Marianne Akian,Jérémie Guillaud,Julien Grand-Clément,Stéphane Gaubert

doi:10.1007/s00224-019-09925-z

Abstract

Entropy games and matrix multiplication games have been recently introduced by Asarin et al. They model the situation in which one player (Despot) wishes to minimize the growth rate of a matrix product, whereas the other player (Tribune) wishes to maximize it. We develop an operator approach to entropy games. This allows us to show that entropy games can be cast as stochastic mean payoff games in which some action spaces are simplices and payments are given by a relative entropy (Kullback-Leibler divergence). In this way, we show that entropy games with a fixed number of states belonging to Despot can be solved in polynomial time. This approach also allows us to solve these games by a policy iteration algorithm, which we compare with the spectral simplex algorithm developed by Protasov.

Highlights

1.1 Entropy games and matrix multiplication gamesEntropy games have been introduced by Asarin et al [5]
Whereas general matrix multiplication games are hard in general, entropy games correspond to a tractable subclass of multiplication games, in which the matrix sets have the property of being invariant by row interchange, the so called independent row uncertainty (IRU) assumption
We report experiments showing that when specialized to one player problems, policy iteration yields a speedup by one order of magnitude by comparison with the “spectral simplex” method recently introduced by Protasov [23]

Summary

Entropy games and matrix multiplication games

Entropy games have been introduced by Asarin et al [5]. They model the situation in which two players with conflicting interests, called “Despot” and “Tribune”, wish to minimize or to maximize a topological entropy representing the freedom of a half-player, “People”. The Operator Approach to Entropy Games in [5] that the problem of comparing the value of an entropy game to a given rational number is in NP ∩ coNP, giving to entropy games a status somehow comparable to other important classes of games with an unsettled complexity, including mean payoff games, simple stochastic games, or stochastic mean payoff games, see [4] for background Another motivation to study entropy games arises from risk sensitive control [13, 14, 3]: as we shall see, essentially the same class of operators arise in the latter setting. Further motivations originate from symbolic dynamics [21, Chapter 1.8.4]

Contribution

Entropy games with prescribed initial state

The original entropy game model

Stochastic mean payoff game with Kullback-Leibler payments

Applying the Collatz-Wielandt theorem to entropy games

Concluding remarks

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theory of Computing Systems	Publication Date: May 30, 2019
Citations: 10	License type: cc-by

R Discovery Prime

R Discovery Prime

The Operator Approach to Entropy Games

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theory of Computing Systems

Lead the way for us

Similar Papers

Log-Sum-Exp Neural Networks and Posynomial Models for Convex and Log-Log-Convex Data.
Giuseppe C Calafiore ... Corrado Possieri
IEEE Transactions on Neural Networks and Learning Systems | VOL. 31
Giuseppe C Calafiore, et. al.Giuseppe C Calafiore ... Corrado Possieri
04 Jan 2019
IEEE Transactions on Neural Networks and Learning Systems | VOL. 31

Soil, climate, time and site factors as drivers of soil structure evolution in agricultural soils from a temperate-boreal region
Tobias Klöffel ... Nicholas Jarvis
Geoderma | VOL. 442
Tobias Klöffel, et. al.Tobias Klöffel ... Nicholas Jarvis
12 Jan 2024
Geoderma | VOL. 442

Filter based Explorized Policy Iteration Algorithm for On-Policy Approximate LQR
Sumit Kumar Jha ... Shubhendu Bhasin
-
Sumit Kumar Jha, et. al.Sumit Kumar Jha ... Shubhendu Bhasin
01 Dec 2019
01 Dec 2019

Traffic Signal Control based on Markov Decision Process**This work is supported in part by the National Science Foundation of China (Grant No. 61374110, 61433002, 61221003), NSFC International Cooperation Project (Grant No. 71361130012).
Yunwen Xu ... Zhao Zhou
IFAC-PapersOnLine | VOL. 49
Yunwen Xu, et. al.Yunwen Xu ... Zhao Zhou
01 Jan 2015
IFAC-PapersOnLine | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Operator Approach to Entropy Games

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theory of Computing Systems