Deep Reinforcement Learning for Minimizing Tardiness in Parallel Machine Scheduling With Sequence Dependent Family Setups

Bohyung Paeng,In-Beom Park,Jonghun Park

doi:10.1109/access.2021.3097254

Bohyung Paeng, In-Beom Park + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3097254

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 16	License type: CC BY 4.0

Affiliation: Seoul National University, Sungkyunkwan University

Abstract

Parallel machine scheduling with sequence-dependent family setups has attracted much attention from academia and industry due to its practical applications. In a real-world manufacturing system, however, solving the scheduling problem becomes challenging since it is required to address urgent and frequent changes in demand and due-dates of products. To minimize the total tardiness of the scheduling problem, we propose a deep reinforcement learning (RL) based scheduling framework in which trained neural networks (NNs) are able to solve unseen scheduling problems without re-training even when such changes occur. Specifically, we propose state and action representations whose dimensions are independent of production requirements and due-dates of jobs while accommodating family setups. At the same time, an NN architecture with parameter sharing was utilized to improve the training efficiency. Extensive experiments demonstrate that the proposed method outperforms the recent metaheuristics, rule-based, and other RL-based methods in terms of total tardiness. Moreover, the computation time for obtaining a schedule by our framework is shorter than those of the metaheuristics and other RL-based methods.

Highlights

As the competition among enterprises intensifies, production scheduling becomes one of the essential decision-making problems in modern manufacturing systems
We focus on the unrelated parallel machine scheduling problem (UPMSP) with sequence-dependent family setup time (SDFST), which has attracted a great deal of attention in various domains such as semiconductor [3]–[5], chemical [6], and food industries [7]
In this paper, we proposed a deep reinforcement learning (DRL)-based method for solving UPMSPs with SDFST constraint to minimize the total tardiness

Summary

INTRODUCTION

As the competition among enterprises intensifies, production scheduling becomes one of the essential decision-making problems in modern manufacturing systems. Since learning complexity grows quickly as the numbers of jobs and machines increase, it is intractable to re-train a DNN whenever such variabilities occur in large-scale manufacturing systems To this end, we propose a DRL-based method for minimizing tardiness for UPMSP with SDFST to address the above challenges. For solving UPMSPs by utilizing RL-based methods, Zhang et al adopted QL to minimize the weighted tardiness [37], [38] They employed a linear basis function to approximate Q-values for given state features indicating the status. Yuan et al [39], [40] addressed ready time constraints and machine breakdown for minimizing the total tardiness and number of tardy jobs, respectively They adopted a tabular method that stores Q-values by exploring state-action pairs.

PROBLEM DESCRIPTION

PROPOSED METHOD

MDP FORMULATION

12: Assign Jj on Mi

5: Observe sk and Wk

EXPERIMENTAL SETTINGS

PERFORMANCE COMPARISON

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Reinforcement Learning for Minimizing Tardiness in Parallel Machine Scheduling With Sequence Dependent Family Setups

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups
Funing Li ... Tobias Reggelin
Journal of Intelligent Manufacturing | VOL. 35
Funing Li, et. al.Funing Li ... Tobias Reggelin
09 Mar 2023
Journal of Intelligent Manufacturing | VOL. 35

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Deep Reinforcement Learning: A New Frontier in Computer Vision Research
Sejuti Rahman ... Sujan Sarker
-
Sejuti Rahman, et. al.Sejuti Rahman ... Sujan Sarker
01 Jan 2020
01 Jan 2020

Bounds on Multiprocessing Timing Anomalies
R L Graham
SIAM Journal on Applied Mathematics | VOL. 17
R L GrahamR L Graham
01 Mar 1969
SIAM Journal on Applied Mathematics | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Reinforcement Learning for Minimizing Tardiness in Parallel Machine Scheduling With Sequence Dependent Family Setups

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access