Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning.

Shengluo Yang,Zhigang Xu,Junyi Wang

doi:10.3390/s21031019

Abstract

Dynamic scheduling problems have been receiving increasing attention in recent years due to their practical implications. To realize real-time and the intelligent decision-making of dynamic scheduling, we studied dynamic permutation flowshop scheduling problem (PFSP) with new job arrival using deep reinforcement learning (DRL). A system architecture for solving dynamic PFSP using DRL is proposed, and the mathematical model to minimize total tardiness cost is established. Additionally, the intelligent scheduling system based on DRL is modeled, with state features, actions, and reward designed. Moreover, the advantage actor-critic (A2C) algorithm is adapted to train the scheduling agent. The learning curve indicates that the scheduling agent learned to generate better solutions efficiently during training. Extensive experiments are carried out to compare the A2C-based scheduling agent with every single action, other DRL algorithms, and meta-heuristics. The results show the well performance of the A2C-based scheduling agent considering solution quality, CPU times, and generalization. Notably, the trained agent generates a scheduling action only in 2.16 ms on average, which is almost instantaneous and can be used for real-time scheduling. Our work can help to build a self-learning, real-time optimizing, and intelligent decision-making scheduling system.

Highlights

This paper solved the dynamic permutation flowshop scheduling problem (PFSP) with new job arrival to minimize total tardiness cost using deep reinforcement learning (DRL)
This study aims to establish an intelligent decision-making scheduling system to provide real-time optimization for dynamic scheduling problems
The DRL-based scheduling system is proposed with state features, actions, and reward designed for the scheduling agent and workshop environment

Summary

Introduction

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. Wu et al [54] used deep learning to solve unreliable machines’ dynamic dispatching in re-entrant production systems They combine a deep neural network (DNN) and Markov decision processes (MDP) to assign different priorities to job groups to minimize cycle time or maximize throughput. Li et al [55] studied the flexible job-shop scheduling problem (FJSP) with sequence-dependent setup times and limited dual resources using machine learning and meta-heuristics. The dynamic PFSP with new job arrival and total tardiness cost criteria has not been solved by DRL. This paper studies the dynamic PFSP with new job arrival to minimize total tardiness cost using DRL. To the best of our knowledge, this is the first attempt to solve the dynamic PFSP with new job arrival to minimize total tardiness cost using DRL. Our work shows the DRL-based scheduling method outperforms traditional metaheuristics (IG and GA) in solution quantity and CPU times by a large margin for dynamic FPSP

Problem Description

Mathematical

Modelling of the Intelligent Scheduling System

Reward

State Features

Actions

Numerical Experiments

Training Process of A2C

Comparison with SDR

Comparison

Comparison with DRL and Meta-Heuristics

IG and GA

Average

50 Iterations

Generalization

Figure

As shown in Figure

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Feb 2, 2021
Citations: 26	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Intelligent Scheduling for Permutation Flow Shop with Dynamic Job Arrival via Deep Reinforcement Learning
Shengluo Yang ... Zhigang Xu
-
Shengluo Yang, et. al.Shengluo Yang ... Zhigang Xu
12 Mar 2021
12 Mar 2021

A DRL-Based Reactive Scheduling Policy for Flexible Job Shops With Random Job Arrivals
Linlin Zhao ... Jiaxin Fan
IEEE Transactions on Automation Science and Engineering | VOL. -
Linlin Zhao, et. al.Linlin Zhao ... Jiaxin Fan
01 Jan 2024
IEEE Transactions on Automation Science and Engineering | VOL. -

Dynamic Job Shop Scheduling Problem with New Job Arrivals: A Survey
Zhen Wang ... Jianfei Si
-
Zhen Wang, et. al.Zhen Wang ... Jianfei Si
07 Aug 2019
07 Aug 2019

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)