Regret bounds of a distributed saddle point algorithm

Alec Koppel,Felicia Y Jakubiec,Alejandro Ribeiro

doi:10.1109/icassp.2015.7178515

Abstract

An algorithm to learn optimal actions in distributed convex repeated games is developed. Learning is repeated because cost functions are revealed sequentially and distributed because they are revealed to agents of a network that can exchange information with neighboring nodes only. Learning is measured in terms of the global networked regret, which is the accumulated loss of causal prediction with respect to a centralized clairvoyant agent to which the information of all times and agents is revealed at the initial time. We use a variant of the Arrow-Hurwicz saddle point algorithm which penalizes local agent disagreement via Lagrange multipliers and leads to a distributed online algorithm. We show that decisions made with this saddle point algorithm lead to regret whose order is not larger than O(√T), where T is the total number of rounds of the game. Numerical behavior is illustrated for the particular case of dynamic sensor network estimation across different network sizes, connectivities, and topologies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Regret bounds of a distributed saddle point algorithm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Saddle Point Algorithm for Networked Online Convex Optimization
Alec Koppel ... Alejandro Ribeiro
IEEE Transactions on Signal Processing | VOL. 63
Alec Koppel, et. al.Alec Koppel ... Alejandro Ribeiro
01 Oct 2015
IEEE Transactions on Signal Processing | VOL. 63

A saddle point algorithm for networked online convex optimization
Alec Koppel ... Felicia Y Jakubiec
-
Alec Koppel, et. al.Alec Koppel ... Felicia Y Jakubiec
01 May 2014
01 May 2014

Task-driven dictionary learning in distributed online settings
Alec Koppel ... Ethan Stump
-
Alec Koppel, et. al.Alec Koppel ... Ethan Stump
01 Nov 2015
01 Nov 2015

Asynchronous Online Learning in Multi-Agent Systems With Proximity Constraints
Amrit Singh Bedi ... Ketan Rajawat
IEEE Transactions on Signal and Information Processing over Networks | VOL. 5
Amrit Singh Bedi, et. al.Amrit Singh Bedi ... Ketan Rajawat
01 Sep 2019
IEEE Transactions on Signal and Information Processing over Networks | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Regret bounds of a distributed saddle point algorithm

Abstract

Talk to us

Similar Papers