Policy Transition of Reinforcement Learning for an Agent Based SCM System

Gang Zhao,Ruoying Sun

doi:10.1109/indin.2006.275663

Abstract

Reinforcement learning (RL) is successfully applied to some dynamical and unpredictable domains. The Supply Chain Management (SCM) is NP-hard problem. Some proposed RL methods perform better than traditional tools for dynamic problem solving in SCM. It realizes on-line learning and performs efficiently in some applications, but RL agent reacts worse than some heuristic methods to sudden changes in SCM demand since the trial-and-error characteristic of RL is time-consuming in practice. By surveying an efficient policy transition mechanism in RL about how to mapping existing policies in the previous task to a new policies in a changed task, this paper proposes a novel RL agent based SCM system that decreases learning time of the RL agent to a dynamic environment. As the result, the RL agent derives the maximal profit using RL technique as jobs coming with a stable distribution. Further, the RL agent makes the optimal procurement satisfying the requirement of sudden changes in the supply chain network by the policy transition mechanism.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Policy Transition of Reinforcement Learning for an Agent Based SCM System

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The Improvement on Reinforcement Learning for SCM by the Agent Policy Mapping
Ruoying Sun ... Shoji Tatsumi
-
Ruoying Sun, et. al.Ruoying Sun ... Shoji Tatsumi
01 Nov 2006
01 Nov 2006

Application of multi-agent Reinforcement Learning to supply chain ordering management
Gang Zhao ... Ruoying Sun
-
Gang Zhao, et. al.Gang Zhao ... Ruoying Sun
01 Aug 2010
01 Aug 2010

Contributors
-
Operations Research | VOL. 59
--
01 Aug 2011
Operations Research | VOL. 59

A research agenda to reflect reality: On being responsive
Robert Glenn Richey ... Beth Davis‐Sramek
Journal of Business Logistics | VOL. 43
Robert Glenn Richey, et. al.Robert Glenn Richey ... Beth Davis‐Sramek
27 Jan 2022
Journal of Business Logistics | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Policy Transition of Reinforcement Learning for an Agent Based SCM System

Abstract

Talk to us

Similar Papers