Abstract
This study addresses the optimal inventory management problem for new smartphone products as an effective example of a supply chain with a short product life cycle. The determination of the optimal inventory level leads to a reduction of lost opportunities and defective inventory, which is an important issue from a profit improvement perspective. Mathematical optimization and reinforcement learning approaches have been proposed for inventory management; however, most of these approaches focus on products that are regularly sold over a long period. Thus, when the target is a new product, it is difficult to optimize inventory control from its day of release due to a lack of sufficient data for learning. To solve this problem, we focus on model-based deep reinforcement learning with high sample efficiency and propose an inventory management method for new products that combines model learning in an offline environment and planning in an online environment. Simulations using real-world historical sales datasets demonstrate that the proposed method outperforms existing methods in terms of profitability, efficiency, and customer satisfaction. In particular, the proposed method improves total rewards and inventory turnover by ¿5% each than the heuristic method while maintaining the same stock-out rate. In addition, the results demonstrate that the proposed method can maintain stable inventory control for multiproduct and multistore supply chains.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.