예산제약이 존재하는 다기간, 다제품 재고관리문제에서 강화학습 기법의 효율성 개선 방안 연구

Jiheon Kim,Daiki Min

doi:10.7737/kmsr.2022.39.2.017

Abstract

This paper considers the use of reinforcement learning for a multi-period, multi-item inventory control problem with a budget constraint. In the problem, we decide the order quantities of multiple items considering budget constraints so as to minimizes the total inventory cost including inventory holding cost and backlog cost. The previous literature proposed a modified Q-learning that include an optimization model in the Q-learning procedure to handle budget constrained actions, but it lacks the scalability. To address this issue, this paper proposed a two-stage method: the Q-learning learns actions without considering the budget constraint in the first stage, and an optimization model adjusts the learned actions so as to satisfy the budget constraint in the second stage. Numerical study compares the performance of the proposed two-stage method with others such as a conventional Q-learning without the budget constraint and the modified Q-learning in the literature. The numerical experiments reveal that the proposed method significantly reduces the computation time without increasing the total inventory cost.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

예산제약이 존재하는 다기간, 다제품 재고관리문제에서 강화학습 기법의 효율성 개선 방안 연구

Abstract

Talk to us

Similar Papers

More From: KOREAN MANAGEMENT SCIENCE REVIEW

Lead the way for us

Similar Papers

예산제약을 고려한 다기간 Newsvendor 문제에서의 Q-learning 기법 적용
Nahee Park ... Xiajiun Lau
Journal of the Korean Operations Research and Management Science Society | VOL. 47
Nahee Park, et. al.Nahee Park ... Xiajiun Lau
28 Feb 2022
Journal of the Korean Operations Research and Management Science Society | VOL. 47

Two parameter-tuned meta-heuristics for a discounted inventory control problem in a fuzzy environment
Seyed Mohsen Mousavi ... Hendrik Simon Cornelis Metselaar
Information Sciences | VOL. 276
Seyed Mohsen Mousavi, et. al.Seyed Mohsen Mousavi ... Hendrik Simon Cornelis Metselaar
27 Feb 2014
Information Sciences | VOL. 276

Proposed Implementation of an Integration Inventory Model to Supply Chain System involving Supplier, Manufacture and Buyer. (Study Case: PT. X)

-

15 Jun 2018
15 Jun 2018

Understanding the Costs of Surgery: A Bottom-Up Cost Analysis of Both a Hybrid Operating Room and Conventional Operating Room.
Sejal Patel ... Maroeska M. Rovers
International Journal of Health Policy and Management | VOL. 11
Sejal Patel, et. al.Sejal Patel ... Maroeska M. Rovers
27 Jul 2020
International Journal of Health Policy and Management | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

예산제약이 존재하는 다기간, 다제품 재고관리문제에서 강화학습 기법의 효율성 개선 방안 연구

Abstract

Talk to us

Similar Papers

More From: KOREAN MANAGEMENT SCIENCE REVIEW