Stochastic Economic Lot Scheduling via Self-Attention Based Deep Reinforcement Learning

Wen Song,Jing Zhuang,Zhiguang Cao,Nan Mi,Qiqiang Li

doi:10.1109/tase.2023.3248229

Abstract

The Stochastic Economic Lot Scheduling Problem (SELSP) is a difficult dynamic optimization problem with wide industrial applications. Traditional methods such as hyper-heuristics are manually designed based on substantial expert knowledge, which may limit their optimization performance. Recently, Deep Reinforcement Learning (DRL) is shown to be promising in automatically learning scheduling policies for SELSP. However, its performance is still quite far from that of hyper-heuristics, due to the lack of suitable deep models. In this paper, we propose a novel DRL method to learn dynamic scheduling policies for SELSP in an end-to-end fashion. Based on self-attention, our method can effectively extract useful features from raw state information, and is flexible in handling different numbers of products, which is not viable for previous methods. Experiments on a complex biopharmaceutical manufacturing process show that our method outperforms a recent DRL method and state-of-the-art hyper-heuristics. Moreover, the trained policy performs better in environments different from training with demand forecast errors and varying number of products, showing its strong robustness and generalization ability. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —The Stochastic Economic Lot Scheduling Problem (SELSP) is an important problem for manufacturing enterprises, which is to optimally balance the production and inventory so as to minimize the total cost. However, SELSP is very challenging to solve due to the involvement of uncertain factors such as customer demands and machine failures. Traditional methods for solving SELSP, such as heuristic policies and hyper-heuristics, heavily rely on human experiences to design and hence the performance could be limited. This paper proposes a Deep Reinforcement Learning (DRL) based method to automatically learn scheduling policy for solving SELSP, which could alleviate the above limitation through a self-attention based feature extraction mechanism and reward based training. Experimental results on a realistic manufacturing process show that our method can deliver higher revenue than conventional manual policy and an existing DRL based method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic Economic Lot Scheduling via Self-Attention Based Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automation Science and Engineering

Lead the way for us

Journal: IEEE Transactions on Automation Science and Engineering	Publication Date: Apr 1, 2024
Citations: 4

Similar Papers

Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle
Qilei Zhang ... Qixin Sha
IEEE Access | VOL. 8
Qilei Zhang, et. al.Qilei Zhang ... Qixin Sha
01 Jan 2020
IEEE Access | VOL. 8

Continuous Control for Autonomous Underwater Vehicle Path Following Using Deep Interactive Reinforcement Learning
Qilei Zhang ... Zheng Fang
-
Qilei Zhang, et. al.Qilei Zhang ... Zheng Fang
01 Oct 2022
01 Oct 2022

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Autonomous Driving Decision-making Based on the Combination of Deep Reinforcement Learning and Rule-based Controller
Jinzhu Wang Jinzhu Wang ... Jie Bai Jie Bai
-
Jinzhu Wang Jinzhu Wang, et. al.Jinzhu Wang Jinzhu Wang ... Jie Bai Jie Bai
30 Sep 2021
30 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic Economic Lot Scheduling via Self-Attention Based Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automation Science and Engineering