Average reward adjusted deep reinforcement learning for order release planning in manufacturing

Manuel Schneckenreither,Stefan Haeussler,Juanjo Peiró

doi:10.1016/j.knosys.2022.108765

Manuel Schneckenreither, Stefan Haeussler + Show 1 more

Open Access

https://doi.org/10.1016/j.knosys.2022.108765

Copy DOI

Abstract

One of the key challenges in production planning, especially in discrete manufacturing, is to determine when to release which orders to the shop floor. The major aim of this planning task is to balance Work-In-Process (WIP) and utilisation levels together with timely completion of orders. The two most crucial attributes of production planning are (i) the highly nonlinear relationship between WIP, flow times and output, and (ii) the dynamically changing environment. Nonetheless, most state-of-the-art models use static lead times to address this problem. Only recently, some papers set lead times dynamically based on the flow time forecasts to react to the dynamic operational characteristics reporting promising results. This paper contributes to this line of research by presenting an order release model that uses reinforcement learning (RL) to set lead times dynamically over time. The applied RL agent is especially designed for processes with periodic feedback and highly variable context. We compare the performance of our new RL algorithm to static order release models and state-of-the-art deep Q-Learning agents by using a multi-stage, multi-product flow-shop simulation model. The results show that, especially for scenarios with high utilisation, our proposed method outperforms the other approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Knowledge-Based Systems	Publication Date: Apr 12, 2022
Citations: 7	License type: cc-by

R Discovery Prime

R Discovery Prime

Average reward adjusted deep reinforcement learning for order release planning in manufacturing

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Similar Papers

Order release planning by iterative simulation and linear programming: Theoretical foundation and analysis of its shortcomings
Hubert Missbauer
European Journal of Operational Research | VOL. 280
Hubert MissbauerHubert Missbauer
16 Jul 2019
European Journal of Operational Research | VOL. 280

Order release planning with predictive lead times: a machine learning approach
Manuel Schneckenreither ... Christoph Gerhold
International Journal of Production Research | VOL. 59
Manuel Schneckenreither, et. al.Manuel Schneckenreither ... Christoph Gerhold
26 Dec 2020
International Journal of Production Research | VOL. 59

Adaptive Order Release Planning with Dynamic Lead Times
Stefan Haeussler ... Christoph Gerhold
IFAC PapersOnLine | VOL. 52
Stefan Haeussler, et. al.Stefan Haeussler ... Christoph Gerhold
01 Jan 2019
IFAC PapersOnLine | VOL. 52

Order release optimisation for time-dependent and stochastic manufacturing systems
Hubert Missbauer ... Manuel Schneckenreither
International Journal of Production Research | VOL. 62
Hubert Missbauer, et. al.Hubert Missbauer ... Manuel Schneckenreither
30 May 2023
International Journal of Production Research | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Average reward adjusted deep reinforcement learning for order release planning in manufacturing

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems