Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Michael Jordan,Tianyi Lin,Zhengyuan Zhou

doi:10.1287/opre.2022.0446

Abstract

Feasible Online Learning with Gradient Feedback Online gradient descent (OGD) is well-known to be doubly optimal under strong convexity or monotonicity assumptions: (1) in the single-agent setting, it achieves an optimal regret of [Formula: see text] for strongly convex cost functions, and (2) in the multiagent setting of strongly monotone games with each agent employing OGD we obtain last-iterate convergence of the joint action to a unique Nash equilibrium at an optimal rate of [Formula: see text]. Whereas these finite-time guarantees highlight its merits, OGD has the drawback that it requires knowing the strong convexity/monotonicity parameters. In “Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback,” M. Jordan, T. Lin, and Z. Zhou design a fully adaptive OGD algorithm, AdaOGD, that does not require a priori knowledge of these parameters. In the single-agent setting, the algorithm achieves [Formula: see text] regret under strong convexity, which is optimal up to a log factor. Further, if each agent employs AdaOGD in strongly monotone games, the joint action converges in a last-iterate sense to a unique Nash equilibrium at a rate of [Formula: see text], again optimal up to log factors. The algorithms are illustrated in a learning version of the classic newsvendor problem, in which, because of lost sales, only (noisy) gradient feedback can be observed. The results immediately yield the first feasible and near-optimal algorithm for both the single-retailer and multiretailer settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Abstract

Talk to us

Similar Papers

More From: Operations Research

Lead the way for us

Similar Papers

Tight global linear convergence rate bounds for Douglas\u2013Rachford splitting
Pontus Giselsson
Journal of Fixed Point Theory and Applications | VOL. 19
Pontus GiselssonPontus Giselsson
13 Mar 2017
Tight global linear convergence rate bounds for Douglas\u2013Rachford splitting
Pontus Giselsson

A Sequential Linear Programming Algorithm for Solving Monotone Variational Inequalities
Patrice Marcotte ... Jean-Pierre Dussault
SIAM Journal on Control and Optimization | VOL. 27
Patrice Marcotte, et. al.Patrice Marcotte ... Jean-Pierre Dussault
01 Nov 1989
SIAM Journal on Control and Optimization | VOL. 27

Multivariate sharp quadratic bounds via $\mathbf{\Sigma}$-strong convexity and the Fenchel connection
Ryan P Browne ... Paul D Mcnicholas
Electronic Journal of Statistics | VOL. 9
Ryan P Browne, et. al.Ryan P Browne ... Paul D Mcnicholas
01 Jan 2015
Electronic Journal of Statistics | VOL. 9

Identifiability of Causal Effects for Binary Variables with Baseline Data Missing Due to Death
Wei Yan ... Yaqin Hu
Biometrics | VOL. 68
Wei Yan, et. al.Wei Yan ... Yaqin Hu
12 Aug 2011
Biometrics | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Abstract

Talk to us

Similar Papers

More From: Operations Research