Online gaming for learning optimal team strategies in real time

Gregory Hudas,F L Lewis,K G Vamvoudakis

doi:10.1117/12.850231

Abstract

This paper first presents an overall view for dynamical decision-making in teams, both cooperative and competitive. Strategies for team decision problems, including optimal control, zero-sum 2-player games (H-infinity control) and so on are normally solved for off-line by solving associated matrix equations such as the Riccati equation. However, using that approach, players cannot change their objectives online in real time without calling for a completely new off-line solution for the new strategies. Therefore, in this paper we give a method for learning optimal team strategies online in real time as team dynamical play unfolds. In the linear quadratic regulator case, for instance, the method learns the Riccati equation solution online without ever solving the Riccati equation. This allows for truly dynamical team decisions where objective functions can change in real time and the system dynamics can be time-varying.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online gaming for learning optimal team strategies in real time

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Online adaptive learning for team strategies in multi-agent systems
Greg Hudas ... Kyriakos G Vamvoudakis
The Journal of Defense Modeling and Simulation: Applications, Methodology, Technology | VOL. 9
Greg Hudas, et. al.Greg Hudas ... Kyriakos G Vamvoudakis
08 Sep 2010
The Journal of Defense Modeling and Simulation: Applications, Methodology, Technology | VOL. 9

Synchronous online learning with integral reinforcement
...
-
, et. al. ...
01 Jan 2012
01 Jan 2012

Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
Kyriakos G Vamvoudakis ... Greg R Hudas
Automatica | VOL. 48
Kyriakos G Vamvoudakis, et. al.Kyriakos G Vamvoudakis ... Greg R Hudas
29 Jun 2012
Automatica | VOL. 48

Online solution to the linear quadratic tracking problem of continuous-time systems using reinforcement learning
Hamidreza Modares ... Frank L Lewis
-
Hamidreza Modares, et. al.Hamidreza Modares ... Frank L Lewis
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online gaming for learning optimal team strategies in real time

Abstract

Talk to us

Similar Papers