A continuous-time approach to online optimization

Joon Kwon,Panayotis Mertikopoulos

doi:10.3934/jdg.2017008

Abstract

We consider a family of mirror descent strategies for online optimization in continuous-time and we show that they lead to no regret. From a more traditional, discrete-time viewpoint, this continuous-time approach allows us to derive the no-regret properties of a large class of discrete-time algorithms including as special cases the exponential weights algorithm, online mirror descent, smooth fictitious play and vanishingly smooth fictitious play. In so doing, we obtain a unified view of many classical regret bounds, and we show that they can be decomposed into a term stemming from continuous-time considerations and a term which measures the disparity between discrete and continuous time. This generalizes the continuous-time based analysis of the exponential weights algorithm from [ 29 ]. As a result, we obtain a general class of infinite horizon learning strategies that guarantee an \begin{document}$\mathcal{O}(n^{-1/2})$ \end{document} regret bound without having to resort to a doubling trick.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Dynamics & Games	Publication Date: Oct 16, 2016
Citations: 47	License type: cc-by

R Discovery Prime

R Discovery Prime

A continuous-time approach to online optimization

Abstract

Talk to us

Similar Papers

More From: Journal of Dynamics & Games

Lead the way for us

Similar Papers

Mirror descent learning in continuous games
Zhengyuan Zhou ... Aris L Moustakas
-
Zhengyuan Zhou, et. al.Zhengyuan Zhou ... Aris L Moustakas
21 Nov 2017
21 Nov 2017

Exponential weight algorithm in continuous time
Sylvain Sorin
Mathematical Programming | VOL. 116
Sylvain SorinSylvain Sorin
25 Apr 2007
Mathematical Programming | VOL. 116

Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent
Gabriele Farina ... Tuomas Sandholm
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Gabriele Farina, et. al.Gabriele Farina ... Tuomas Sandholm
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Scaling Mean Field Games with Online Mirror Descent

-

20 Apr 2022
20 Apr 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A continuous-time approach to online optimization

Abstract

Talk to us

Similar Papers

More From: Journal of Dynamics &amp; Games

More From: Journal of Dynamics & Games