A projection-based continuous-time algorithm for distributed optimization over multi-agent systems

Xingnan Wen,Sitian Qin

doi:10.1007/s40747-020-00265-x

Abstract

Multi-agent systems are widely studied due to its ability of solving complex tasks in many fields, especially in deep reinforcement learning. Recently, distributed optimization problem over multi-agent systems has drawn much attention because of its extensive applications. This paper presents a projection-based continuous-time algorithm for solving convex distributed optimization problem with equality and inequality constraints over multi-agent systems. The distinguishing feature of such problem lies in the fact that each agent with private local cost function and constraints can only communicate with its neighbors. All agents aim to cooperatively optimize a sum of local cost functions. By the aid of penalty method, the states of the proposed algorithm will enter equality constraint set in fixed time and ultimately converge to an optimal solution to the objective problem. In contrast to some existed approaches, the continuous-time algorithm has fewer state variables and the testification of the consensus is also involved in the proof of convergence. Ultimately, two simulations are given to show the viability of the algorithm.

Highlights

Reinforcement learning stems from an experiment on the behaviors of cats in 1898 by Thorndike [20]
deep reinforcement learning (DRL) is a interdiscipline of reinforcement learning and deep learning to cope with environments with high dimensions [17]
The distributed optimization problem is reformulated to a new one without inequality constraints and consensus constraints

Summary

Introduction

Reinforcement learning stems from an experiment on the behaviors of cats in 1898 by Thorndike [20]. We construct a projection-based continuous-time algorithm to solve distributed optimization problems with equality and inequality constraints over multi-agent systems in this paper. To solve distributed optimization problem (3), a projectionbased continuous-time algorithm is proposed as follows: xi (t ). Proposition 4 The equilibrium point of continuous-time algorithm (18) is an optimal solution to distributed optimization problem (3) and vice versa. In this part, with the help of Lyapunov method and above preliminaries, we will study the convergence of continuoustime algorithm (17). Remark 4 It is worth noting that the property of entering one of the constraints or feasible region is possessed by many continuous-time algorithms for solving optimization problems, such as [8,11,19,34].

Objective functions

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Complex & Intelligent Systems	Publication Date: Jan 11, 2021
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

A projection-based continuous-time algorithm for distributed optimization over multi-agent systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

A continuous-time neurodynamic approach and its discretization for distributed convex optimization over multi-agent systems
Xingnan Wen ... Sitian Qin
Neural Networks | VOL. 143
Xingnan Wen, et. al.Xingnan Wen ... Sitian Qin
21 May 2021
Neural Networks | VOL. 143

Load-aware continuous-time optimization for multi-agent systems: toward dynamic resource allocation and real-time adaptability
Qianxing Wang ... Amin Mohajer
Computer Networks | VOL. 250
Qianxing Wang, et. al.Qianxing Wang ... Amin Mohajer
04 Jun 2024
Computer Networks | VOL. 250

Distributed optimization with the consideration of adaptivity and finite-time convergence
Peng Lin ... Yongduan Song
-
Peng Lin, et. al.Peng Lin ... Yongduan Song
01 Jun 2014
01 Jun 2014

Randomized Gradient-Free Distributed Online Optimization with Time-Varying Cost Functions
Yipeng Pang ... Guoqiang Hu
-
Yipeng Pang, et. al.Yipeng Pang ... Guoqiang Hu
01 Dec 2019
01 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A projection-based continuous-time algorithm for distributed optimization over multi-agent systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Complex & Intelligent Systems