Learning Suboptimal Broadcasting Intervals in Multi-Agent Systems

Domagoj Tolić,Ivana Palunko

doi:10.1016/j.ifacol.2017.08.802

Abstract

In this paper, agents learn how often to exchange information with neighbors in cooperative Multi-Agent Systems (MASs) such that their user-defined cost functions are minimized. The investigated cost functions capture trade-offs between the MAS local control performance and energy consumption of each agent in the presence of exogenous disturbances. Agent energy consumption is critical for prolonging the MAS mission and is comprised of both control (e.g., acceleration, velocity) and communication efforts. The proposed methodology starts off by computing upper bounds on asynchronous broadcasting intervals that provably stabilize the MAS. Subsequently, we utilize these upper bounds as optimization constraints and employ an online learning algorithm based on Least Square Policy Iteration (LSPI) to minimize the cost function for each agent. Consequently, the obtained broadcasting intervals adapt to the most recent information (e.g., delayed and noisy agents’ inputs and/or outputs) received from neighbors and provably stabilize the MAS. Chebyshev polynomials are utilized as the approximator in the LSPI while Kalman Filtering (KF) handles sampled, corrupted and delayed data. The proposed methodology is exemplified in a consensus control problem with general linear agent dynamics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Suboptimal Broadcasting Intervals in Multi-Agent Systems

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Journal: IFAC PapersOnLine	Publication Date: Jul 1, 2017
Citations: 4

Similar Papers

Learning near‐optimal broadcasting intervals in decentralized multi‐agent systems using online least‐square policy iteration
Ivana Palunko ... Vicko Prkačin
IET Control Theory & Applications | VOL. 15
Ivana Palunko, et. al.Ivana Palunko ... Vicko Prkačin
25 Feb 2021
IET Control Theory & Applications | VOL. 15

Fuzzy Least Square Policy Iteration and Its Mathematical Analysis
Farzaneh Ghorbani ... Mohsen Afsharchi
International Journal of Fuzzy Systems | VOL. 19
Farzaneh Ghorbani, et. al.Farzaneh Ghorbani ... Mohsen Afsharchi
14 Nov 2016
International Journal of Fuzzy Systems | VOL. 19

Kernel-Based Least Squares Policy Iteration for Reinforcement Learning
Xin Xu ... Dewen Hu
IEEE Transactions on Neural Networks | VOL. 18
Xin Xu, et. al.Xin Xu ... Dewen Hu
01 Jul 2007
IEEE Transactions on Neural Networks | VOL. 18

Incremental least squares policy iteration in reinforcement learning for control
Chun-Gui Li ... Meng Wang
-
Chun-Gui Li, et. al. Chun-Gui Li ... Meng Wang
01 Jul 2008
01 Jul 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Suboptimal Broadcasting Intervals in Multi-Agent Systems

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine