A recurrent reinforcement learning strategy for optimal scheduling of partially observable job-shop and flow-shop batch chemical plants under uncertainty

Daniel Rangel-Martinez,Luis A Ricardez-Sandoval

doi:10.1016/j.compchemeng.2024.108748

Abstract

This study presents a methodology that makes use of Deep Recurrent Q-Learning to develop an agent that acts as an online scheduler for flow-shop or job-shop batch plants with zero-wait restriction under uncertainty. The environment is assumed to be partially observable, i.e., it does not follow the Markov property and information has to be gathered from previous time intervals. The processing times of the machines are unknown to the agent whereas production demand realizations are provided during the operation and not known a priori. The agent aims to complete the demands and to minimize the makespan of the process. Moreover, the agent should avoid violation of constraints associated with product allocation, time horizon, and storage capacity. Three case studies featuring two job-shops and one flow-shop are presented to show the benefits of this framework with environments where the information is limited. Results showed that the agents can generate schedules considering the uncertain parameters of the system while aiming to reduce the makespan of the process. Tests on the agents resulted in small errors in the decision-making process (less than 2%) thus demonstrating that DRQN can serve as a reliable tool for online scheduling subject to uncertainty in partially observable job and flow-shops.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A recurrent reinforcement learning strategy for optimal scheduling of partially observable job-shop and flow-shop batch chemical plants under uncertainty

Abstract

Talk to us

Similar Papers

More From: Computers and Chemical Engineering

Lead the way for us

Journal: Computers and Chemical Engineering	Publication Date: May 29, 2024
License type: cc-by-nc-nd

Similar Papers

Design and evaluation of multi-objective online scheduling strategies for parallel machines using computational intelligence

-

24 Nov 2006
24 Nov 2006

Research on online scheduling and charging strategy of robots based on shortest path algorithm
Xiao Fu ... Jiaxin Wang
Computers & Industrial Engineering | VOL. 153
Xiao Fu, et. al.Xiao Fu ... Jiaxin Wang
05 Jan 2021
Computers & Industrial Engineering | VOL. 153

All Options Are on the Table? Time Horizons and the Decision-Making Process in Conflict
Rotem Dvir
Foreign Policy Analysis | VOL. 17
Rotem DvirRotem Dvir
18 Aug 2021
Foreign Policy Analysis | VOL. 17

Analysis of the Performance of FLNG Vessels According to Basic Design Parameters
Daniel P Vieira ... Claudio M P Sampaio
-
Daniel P Vieira, et. al.Daniel P Vieira ... Claudio M P Sampaio
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A recurrent reinforcement learning strategy for optimal scheduling of partially observable job-shop and flow-shop batch chemical plants under uncertainty

Abstract

Talk to us

Similar Papers

More From: Computers and Chemical Engineering