A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem

Carlos D Paternina-Arboleda,Tapas K Das

doi:10.1016/j.simpat.2004.12.003

Abstract

This paper presents a methodology that, for the problem of scheduling of a single server on multiple products, finds a dynamic control policy via intelligent agents. The dynamic (state dependent) policy optimizes a cost function based on the WIP inventory, the backorder penalty costs and the setup costs, while meeting the productivity constraints for the products. The methodology uses a simulation optimization technique called Reinforcement Learning (RL) and was tested on a stochastic lot-scheduling problem (SELSP) having a state–action space of size 1.8 × 10 7. The dynamic policies obtained through the RL-based approach outperformed various cyclic policies. The RL approach was implemented via a multi-agent control architecture where a decision agent was assigned to each of the products. A Neural Network based approach (least mean square (LMS) algorithm) was used to approximate the reinforcement value function during the implementation of the RL-based methodology. Finally, the dynamic control policy over the large state space was extracted from the reinforcement values using a commercially available tree classifier tool.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem

Abstract

Talk to us

Similar Papers

More From: Simulation Modelling Practice and Theory

Lead the way for us

Journal: Simulation Modelling Practice and Theory	Publication Date: Jan 22, 2005
Citations: 76

Similar Papers

Dynamic versus static control policies in single stage production-inventory systems
M Z Babai ... Y Dallery
International Journal of Production Research | VOL. 47
M Z Babai, et. al.M Z Babai ... Y Dallery
20 Nov 2008
International Journal of Production Research | VOL. 47

Intelligent dynamic control policies for serial production lines
Carlos D Paternina-Arboleda ... Tapas K Das
IIE Transactions | VOL. 33
Carlos D Paternina-Arboleda, et. al.Carlos D Paternina-Arboleda ... Tapas K Das
01 Jan 2001
IIE Transactions | VOL. 33

Intelligent dynamic control policies for serial production lines
Carlos D Paternina-Arboleda ... Tapas K Das
IIE Transactions | VOL. 33
Carlos D Paternina-Arboleda, et. al.Carlos D Paternina-Arboleda ... Tapas K Das
01 Jan 2001
IIE Transactions | VOL. 33

Dynamic Scheduling of a Four-Station Queueing Network
C N Laws ... G M Louth
Probability in the Engineering and Informational Sciences | VOL. 4
C N Laws, et. al.C N Laws ... G M Louth
01 Jan 1990
Probability in the Engineering and Informational Sciences | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem

Abstract

Talk to us

Similar Papers

More From: Simulation Modelling Practice and Theory