The Improvement on Reinforcement Learning for SCM by the Agent Policy Mapping

Ruoying Sun,Chen Li,Gang Zhao,Shoji Tatsumi

doi:10.1109/iecon.2006.347360

Abstract

The reinforcement learning (RL) is an efficient and popular way for solving problems that an agent has no knowledge about the environment a priori, which owns two characteristics: trial-and-error and delayed rewards. An RL agent must derive an optimal policy by directly interacting with the environment and getting the information about the environment. Supply chain management (SCM) is a challenging problem for the agent-based electronic business. Some proposed RL methods perform better than traditional tools for dynamic problem solving in SCM. It realizes on-line learning and performs efficiently in some applications, but RL agent reacts worse than some heuristic methods to sudden changes in SCM demand since the trial-and-error characteristic of RL is time-consuming in practice. By surveying an efficient policy transition mechanism in RL about how to mapping existing policies in the previous task to a new policies in a changed task, this paper proposes a novel RL agent based SCM system that decreases learning time of the RL agent to a dynamic environment. As the result, the RL agent derives the maximal profit using RL technique as jobs coming with a stable distribution. Further, the RL agent makes the optimal procurement satisfying the requirement of sudden changes in the supply chain network by the policy transition mechanism

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Improvement on Reinforcement Learning for SCM by the Agent Policy Mapping

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Policy Transition of Reinforcement Learning for an Agent Based SCM System
Gang Zhao ... Ruoying Sun
-
Gang Zhao, et. al.Gang Zhao ... Ruoying Sun
01 Aug 2006
01 Aug 2006

Application of multi-agent Reinforcement Learning to supply chain ordering management
Gang Zhao ... Ruoying Sun
-
Gang Zhao, et. al.Gang Zhao ... Ruoying Sun
01 Aug 2010
01 Aug 2010

Developing Supply Chain Management System Evaluation Attributes Based on the Supply Chain Strategy
Chun-Chin Wei ... Liang-Tu Che
-
Chun-Chin Wei, et. al.Chun-Chin Wei ... Liang-Tu Che
01 Feb 2008
01 Feb 2008

A research agenda to reflect reality: On being responsive
Robert Glenn Richey ... Beth Davis‐Sramek
Journal of Business Logistics | VOL. 43
Robert Glenn Richey, et. al.Robert Glenn Richey ... Beth Davis‐Sramek
27 Jan 2022
Journal of Business Logistics | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Improvement on Reinforcement Learning for SCM by the Agent Policy Mapping

Abstract

Talk to us

Similar Papers