Multi-agent Reinforcement Learning Method Research Articles

AbstractSelf-organizing manufacturing network has emerged as a viable solution for adaptive manufacturing control within the mass personalization paradigm. This approach involves three critical elements: system modeling and control architecture, interoperable communication, and adaptive manufacturing control. However, current research often separates interoperable communication from adaptive manufacturing control as isolated areas of study. To address this gap, this paper introduces Knowledge Graph-enhanced Multi-Agent Reinforcement Learning (MARL) method that integrates interoperable communication via Knowledge Graphs with adaptive manufacturing control through Reinforcement Learning. We hypothesize that implicit domain knowledge obtained from historical production job allocation records can guide each agent to learn more effective scheduling policies with accelerated learning rates. This is based on the premise that machine assignment preferences effectively could reduce the Reinforcement Learning search space. Specifically, we redesign machine agents with new observation, action, reward, and cooperation mechanisms considering the preference of machines, building upon our previous MARL base model. The scheduling policies are trained under extensive simulation experiments that consider manufacturing requirements. During the training process, our approach demonstrates improved training speed compared with individual Reinforcement Learning methods under the same training hyperparameters. The obtained scheduling policies generated by our Knowledge Graph-enhanced MARL also outperform both individual Reinforcement Learning methods and heuristic rules under dynamic manufacturing settings.

Read full abstract

The mainstream Multi-Agent Reinforcement Learning (MARL) methods introduce the teammate modeling or the communication mechanism into Centralized Training Decentralized Execution (CTDE) paradigm, which can improve coordination performance. However, the existing teammate modeling methods predict either actions or local observations, limiting their applicability. In addition, the traditional communication mechanism only considers the quantity of the communication links while ignoring the quality of retained communication links, leading to inefficient and redundant communication. To solve the above problems, this paper proposes a novel Multi-Agent Cooperative Strategy with Explicit Teammate Modeling and Targeted Informative Communication (MACS), which can generate and send the more informative message with the higher communication efficiency, further improving the coordination performance. Specifically, the Variational Auto-Encoder (VAE) is leveraged to allow each agent to simultaneously predict the observations and actions of teammates, thus generating more comprehensive communication message. Then, we propose a new Mutual Information (MI) between the communication message and teammate Q-value, which can obtain the informative message, ensuring the exploration and stability of the method. In addition, a targeted dynamic informative communication graph is established by the Graph Neural Network (GNN) which can reduce the redundant communication link through hypothetical analysis, further improving the overall communication efficiency. Eventually, we conduct experiments in StarCraft II, Collaborative Navigation, and Multi-Target Multi-Sensor Coverage environments. Experimental results show that the proposed approach is superior to the state-of-the-art in terms of coordination performance and communication efficiency.

Read full abstract

Multi-agent Reinforcement Learning Method Research Articles

Related Topics

Articles published on Multi-agent Reinforcement Learning Method

Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method.

Knowledge graph-enhanced multi-agent reinforcement learning for adaptive scheduling in smart manufacturing

Safe robust multi-agent reinforcement learning with neural control barrier functions and safety attention mechanism

Impulsive maneuver strategy for multi-agent orbital pursuit-evasion game under sparse rewards

Multiple Unmanned Aerial Vehicle (multi-UAV) Reconnaissance and Search with Limited Communication Range Using Semantic Episodic Memory in Reinforcement Learning

Graph-based multi-agent reinforcement learning for collaborative search and tracking of multiple UAVs

Learning Multi-Agent Cooperation via Considering Actions of Teammates.

Joint optimization of communication and mission performance for multi-UAV collaboration network: A multi-agent reinforcement learning method

Environmental-Impact-Based Multi-Agent Reinforcement Learning

Generative subgoal oriented multi-agent reinforcement learning through potential field

Securing demand–response in smart grids against false pricing attacks

A multi-agent reinforcement learning method for distribution system restoration considering dynamic network reconfiguration

Long-short-view aware multi-agent reinforcement learning for signal snippet distillation in delirium movement detection

Joint Spectrum and Power Allocation in Wireless Network: A Two-Stage Multi-Agent Reinforcement Learning Method

Multi-agent reinforcement learning method for cutting parameters optimization based on simulation and experiment dual drive environment

A method of network attack-defense game and collaborative defense decision-making based on hierarchical multi-agent reinforcement learning

Population Game-Assisted Multi-Agent Reinforcement Learning Method for Dynamic Multi-Vehicle Route Selection

A smart inventory management system with medication demand dependencies in a hospital supply chain: A multi-agent reinforcement learning approach

Cooperative Multiagent Reinforcement Learning Coupled With A* Search for Ship Multicabin Equipment Layout Considering Pipe Route

Multi-agent cooperative strategy with explicit teammate modeling and targeted informative communication

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-agent Reinforcement Learning Method Research Articles

Related Topics

Articles published on Multi-agent Reinforcement Learning Method

Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method.

Knowledge graph-enhanced multi-agent reinforcement learning for adaptive scheduling in smart manufacturing

Safe robust multi-agent reinforcement learning with neural control barrier functions and safety attention mechanism

Impulsive maneuver strategy for multi-agent orbital pursuit-evasion game under sparse rewards

Multiple Unmanned Aerial Vehicle (multi-UAV) Reconnaissance and Search with Limited Communication Range Using Semantic Episodic Memory in Reinforcement Learning

Graph-based multi-agent reinforcement learning for collaborative search and tracking of multiple UAVs

Learning Multi-Agent Cooperation via Considering Actions of Teammates.

Joint optimization of communication and mission performance for multi-UAV collaboration network: A multi-agent reinforcement learning method

Environmental-Impact-Based Multi-Agent Reinforcement Learning

Generative subgoal oriented multi-agent reinforcement learning through potential field

Securing demand–response in smart grids against false pricing attacks

A multi-agent reinforcement learning method for distribution system restoration considering dynamic network reconfiguration

Long-short-view aware multi-agent reinforcement learning for signal snippet distillation in delirium movement detection

Joint Spectrum and Power Allocation in Wireless Network: A Two-Stage Multi-Agent Reinforcement Learning Method

Multi-agent reinforcement learning method for cutting parameters optimization based on simulation and experiment dual drive environment

A method of network attack-defense game and collaborative defense decision-making based on hierarchical multi-agent reinforcement learning

Population Game-Assisted Multi-Agent Reinforcement Learning Method for Dynamic Multi-Vehicle Route Selection

A smart inventory management system with medication demand dependencies in a hospital supply chain: A multi-agent reinforcement learning approach

Cooperative Multiagent Reinforcement Learning Coupled With A* Search for Ship Multicabin Equipment Layout Considering Pipe Route

Multi-agent cooperative strategy with explicit teammate modeling and targeted informative communication