Multi-agent Reinforcement Learning Research Articles

The high-quality development of the manufacturing industry necessitates accelerating its transformation towards high-end, intelligent, and green development. Considering logistics resource constraints, the impact of dynamic disturbance events on production, and the need for energy-efficient production, the integrated scheduling of production equipment and automated guided vehicles (AGVs) in a flexible job shop environment is investigated in this study. Firstly, a static model for the integrated scheduling of production equipment and AGVs (ISPEA) is developed based on mixed-integer programming, which aims to optimize the maximum completion time and total production energy consumption (EC). In recent years, reinforcement learning, including deep reinforcement learning (DRL), has demonstrated significant advantages in handling workshop scheduling issues with sequential decision-making characteristics, which can fully utilize the vast quantity of historical data accumulated in the workshop and adjust production plans in a timely manner based on changes in production conditions and demand. Accordingly, a DRL-based approach is introduced to address the common production disturbances in emergency order insertions. Combined with the characteristics of the ISPEA problem and an event-driven strategy for handling dynamic events, four types of agents, namely workpiece selection, machine selection, AGV selection, and target selection agents, are set up, which refine workshop production status characteristics as observation inputs and generate rules for selecting workpieces, machines, AGVs, and targets. These agents are trained offline using the QMIX multi-agent reinforcement learning framework, and the trained agents are utilized to solve the dynamic ISPEA problem. Finally, the effectiveness of the proposed model and method is validated through a comparison of the solution performance with other typical optimization algorithms for various cases.

Read full abstract

Multi-agent systems (MAS) consist of multiple autonomous agents interacting to achieve collective objectives. Multi-agent reinforcement learning (MARL) enhances these systems by enabling agents to learn optimal behaviors through interaction, thus improving their coordination in dynamic environments. However, MARL faces significant challenges in adapting to complex dependencies on past states and actions, which are not adequately represented by the current state alone in reactive systems. This paper addresses these challenges by considering MAS operating under task specifications formulated as Generalized Reactivity of rank 1 (GR(1)). These synthesized strategies are used as a priori knowledge to guide the learning. To tackle the difficulties of handling non-Markovian tasks in reactive systems, we propose a novel synchronized decentralized training paradigm that guides agents to learn within the MARL framework using a reward structure constructed from decomposed synthesized strategies of GR(1). We initially formalize the synthesis of GR(1) strategies as a reachability problem of winning states of the system. Subsequently, we develop a decomposition mechanism that constructs individual reward structures for decentralized MARL, incorporating potential values calculated through value iteration. Theoretical proofs are provided to verify that the safety and liveness properties are preserved. We evaluate our approach against other state-of-the-art methods under various GR(1) specifications and scenario maps, demonstrating superior learning efficacy and optimal rewards per episode. Additionally, we show that the decentralized training paradigm outperforms the centralized training paradigm. The value iteration strategy used to calculate potential values for the reward structure is compared against two other strategies, showcasing its advantages.

Read full abstract

Multi-agent Reinforcement Learning Research Articles

Related Topics

Articles published on Multi-agent Reinforcement Learning

CESDQL: Communicative experience-sharing deep Q-learning for scalability in multi-robot collaboration with sparse reward

Feature selection integrating Shapley values and mutual information in reinforcement learning: An application in the prediction of post-operative outcomes in patients with end-stage renal disease

Cooperative price-based demand response program for multiple aggregators based on multi-agent reinforcement learning and shapley-value

Efficient multi-agent reinforcement learning HVAC power consumption optimization

A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

Real-time bidding with multi-agent reinforcement learning in multi-channel display advertising

Robust dynamic real-time control strategies for high-frequency bus service: a multi-agent reinforcement learning framework

High-Sample-Efficient Multiagent Reinforcement Learning for Navigation and Collision Avoidance of UAV Swarms in Multitask Environments

Adaptive Multi-layer Attention Double Dueling Deep Q-Network for Muti-agent Reinforcement Learning

Autonomous ecologies of construction: Collaborative modular robotic material eco-systems with deep multi-agent reinforcement learning

Multi-Vehicle Cooperative Decision-Making in Merging Area Based on Deep Multi-Agent Reinforcement Learning

Message Action Adapter Framework in Multi-Agent Reinforcement Learning

Integrated Early Flood Prediction using Sentinel-2 Imagery with VANET-MARL-based Deep Neural RNN

Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning

Dynamic Integrated Scheduling of Production Equipment and Automated Guided Vehicles in a Flexible Job Shop Based on Deep Reinforcement Learning

Multi-agent reinforcement learning with synchronized and decomposed reward automaton synthesized from reactive temporal logic

Adaptive multi-agent reinforcement learning for dynamic pricing and distributed energy management in virtual power plant networks

Selective Information Communication Considering the Influence of Each Other’s Messages in Multi-Agent Reinforcement Learning

Coordinating multi-agent reinforcement learning via dual collaborative constraints

A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-agent Reinforcement Learning Research Articles

Related Topics

Articles published on Multi-agent Reinforcement Learning

CESDQL: Communicative experience-sharing deep Q-learning for scalability in multi-robot collaboration with sparse reward

Feature selection integrating Shapley values and mutual information in reinforcement learning: An application in the prediction of post-operative outcomes in patients with end-stage renal disease

Cooperative price-based demand response program for multiple aggregators based on multi-agent reinforcement learning and shapley-value

Efficient multi-agent reinforcement learning HVAC power consumption optimization

A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

Real-time bidding with multi-agent reinforcement learning in multi-channel display advertising

Robust dynamic real-time control strategies for high-frequency bus service: a multi-agent reinforcement learning framework

High-Sample-Efficient Multiagent Reinforcement Learning for Navigation and Collision Avoidance of UAV Swarms in Multitask Environments

Adaptive Multi-layer Attention Double Dueling Deep Q-Network for Muti-agent Reinforcement Learning

Autonomous ecologies of construction: Collaborative modular robotic material eco-systems with deep multi-agent reinforcement learning

Multi-Vehicle Cooperative Decision-Making in Merging Area Based on Deep Multi-Agent Reinforcement Learning

Message Action Adapter Framework in Multi-Agent Reinforcement Learning

Integrated Early Flood Prediction using Sentinel-2 Imagery with VANET-MARL-based Deep Neural RNN

Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning

Dynamic Integrated Scheduling of Production Equipment and Automated Guided Vehicles in a Flexible Job Shop Based on Deep Reinforcement Learning

Multi-agent reinforcement learning with synchronized and decomposed reward automaton synthesized from reactive temporal logic

Adaptive multi-agent reinforcement learning for dynamic pricing and distributed energy management in virtual power plant networks

Selective Information Communication Considering the Influence of Each Other’s Messages in Multi-Agent Reinforcement Learning

Coordinating multi-agent reinforcement learning via dual collaborative constraints

A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning