Multi-agent Reinforcement Learning Method Research Articles

In multi-agent reinforcement learning, it is essential for agents to learn communication protocol to optimize collaboration policies and to solve unstable learning problems. Existing methods based on actor-critic networks solve the communication problem among agents. However, these methods have difficulty in improving sample efficiency and learning robust policies because it is not easy to understand the dynamics and nonstationary of the environment as the policies of other agents change. We propose a method for learning cooperative policies in multi-agent environments by considering the communications among agents. The proposed method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients to centrally train decentralized policies. The actor networks cause the agents to communicate using forward and backward paths and to determine subsequent actions. The critic network helps to train the actor networks by sending gradient signals to the actors according to their contribution to the global reward. To address issues with partial observability and unstable learning, we propose using auxiliary prediction networks to approximate state transitions and the reward function. We used multi-agent environments to demonstrate the usefulness and superiority of the proposed method by comparing it with existing multi-agent reinforcement learning methods, in terms of both learning efficiency and goal achievements in the test phase. The results demonstrate that the proposed method outperformed other alternatives.

The combination of multi-agent technology and reinforcement learning methods has been recognized as an effective way which is used in path planning-based crowd simulation. However, the existing solution is still not satisfactory due to the problem in the mutual influence of agents. Therefore, an improved multi-agent reinforcement learning method (IMARL algorithm) is introduced. In this method, the intersection of the pedestrian trajectory extracted from the real video is first used as the state space for reinforcement learning. The crowd is grouped and the leader is selected. A bulletin board is added to the reinforcement learning algorithm of multi-agent to store the empirical knowledge of the learning process, and the navigation agent passes information between the leader and the bulletin board. The original social force model was improved, and the cohesive force of visual factors was added to the force formula. The IMARL algorithm is combined with the improved social force model for crowd evacuation simulation. Using a two-layer control mechanism, the leader in the upper layer uses the decision process based on the IMARL algorithm to select the path, and the individuals in the bottom group use the improved social force model to evacuate. The method of this paper not only solves the dimensionality disaster problem of reinforcement learning but also improves the convergence speed. The evacuation efficiency is effectively improved in crowd evacuation simulation experiments. In addition, it can also provide specific guidance scheme for crowd evacuation improvement and assistant decision support for the prevention and management of large-scale group trampling incidents.

Multi-agent Reinforcement Learning Method Research Articles

Related Topics

Articles published on Multi-agent Reinforcement Learning Method

Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning

WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks

Utilizing Observed Information for No-Communication Multi-Agent Reinforcement Learning toward Cooperation in Dynamic Environment

SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes

Efficient Training Techniques for Multi-Agent Reinforcement Learning in Combat Tasks

Improved Multi-Agent Reinforcement Learning for Path Planning-Based Crowd Simulation

A New Multi-Agent Reinforcement Learning Method Based on Evolving Dynamic Correlation Matrix

Energy Optimization of Solar Micro-Grid Using Multi Agent Reinforcement Learning

あるクラスのジレンマ問題に対するマルチエージェント強化学習法

A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

A multi-agent reinforcement learning approach to robot soccer

Research on Path-Planning of Manipulator Based on Multi-Agent Reinforcement Learning

A novel multi-agent reinforcement learning approach for job scheduling in Grid computing

Adaptive Division-of-Labor Control Algorithm for Multi-Robot Systems

Cooperative Multiagent Congestion Control for High-Speed Networks

A Multi-Agent Reinforcement Learning Method Based on Role Tracking

A multiagent reinforcement learning method based on the model inference of the other agents

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-agent Reinforcement Learning Method Research Articles

Related Topics

Articles published on Multi-agent Reinforcement Learning Method

Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning

WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks

Utilizing Observed Information for No-Communication Multi-Agent Reinforcement Learning toward Cooperation in Dynamic Environment

SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes

Efficient Training Techniques for Multi-Agent Reinforcement Learning in Combat Tasks

Improved Multi-Agent Reinforcement Learning for Path Planning-Based Crowd Simulation

A New Multi-Agent Reinforcement Learning Method Based on Evolving Dynamic Correlation Matrix

Energy Optimization of Solar Micro-Grid Using Multi Agent Reinforcement Learning

あるクラスのジレンマ問題に対するマルチエージェント強化学習法

A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

A multi-agent reinforcement learning approach to robot soccer

Research on Path-Planning of Manipulator Based on Multi-Agent Reinforcement Learning

A novel multi-agent reinforcement learning approach for job scheduling in Grid computing

Adaptive Division-of-Labor Control Algorithm for Multi-Robot Systems

Cooperative Multiagent Congestion Control for High-Speed Networks

A Multi-Agent Reinforcement Learning Method Based on Role Tracking

A multiagent reinforcement learning method based on the model inference of the other agents