A learning-based ordering policy for transition to intelligent perishable inventory management

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

ABSTRACT This paper contributes to perishable and substitutable inventory management literature by offering an intelligence-based decision-making structure exemplifying platelets under stochastic and age-different demand. An intelligent agent determines the hospital’s ordering policy while minimizing the long-run total cost. Platelets’ price and substitution depend on the age of platelets. The problem is configured using reinforcement learning and solved by SARSA and Q-learning to recommend (near) optimal ordering policy. To evaluate the performance of the structure, it is compared to (s, S), a traditional periodic-review policy formulated by two-stage stochastic programming through several test problems. The scheme is then tested for two famous issuing policies, FIFO and LIFO, compared with our proposed preference-based issuance. Findings demonstrate that the intelligent structure offers the inventory system an average cost savings of 10.13% due to higher responsiveness to uncertainty. Besides, the designed issuance surpasses FIFO and LIFO when the quality of platelet transfusions comes to importance.

Similar Papers
  • Research Article
  • Cite Count Icon 17
  • 10.1137/050643805
On the Implementation of Interior Point Decomposition Algorithms for Two-Stage Stochastic Conic Programs
  • Jan 1, 2009
  • SIAM Journal on Optimization
  • Sanjay Mehrotra + 1 more

In this paper we develop a practical primal interior decomposition algorithm for two-stage stochastic programming problems. The framework of this algorithm is similar to the framework in [S. Mehrotra and M. G. Özevin, “Decomposition based interior point methods for two-stage stochastic convex quadratic programs with recourse,” to appear in Oper. Res.; S. Mehrotra and M. G. Özevin, SIAM J. Optim., 18 (2007), pp. 206–222] and [G. Zhao, Math. Program., 90 (2001), pp. 507–536], however, their algorithm is altered in a simple yet fundamental way to achieve practical performance. In particular, this new algorithm weighs the log-barrier terms in the second stage problems differently from the theoretical algorithms analyzed in [S. Mehrotra and M. G. Özevin, Oper. Res., to appear], [S. Mehrotra and M. G. Özevin, SIAM J. Optim., 18 (2007), pp. 206–222], and [G. Zhao, Math. Program., 90 (2001), pp. 507–536]. We give a method for generating a suitable starting point; a method for selecting a good starting barrier parameter; a heuristic for first stage step-length calculation without performing line searches; and a method for adaptive addition of new scenarios over the course of the algorithm. The decomposition algorithm is implemented to solve two-stage stochastic conic programs with recourse whose underlying cones are Cartesian products of linear, second order, and semidefinite cones. The performance of primal decomposition method is studied on a set of randomly generated test problems as well as a two-stage stochastic programming extension of the Markowitz portfolio selection model. The computational results show that an efficient and stable implementation of the primal decomposition method is possible. These results also show that in problems with a large number of scenarios, the adaptive addition of scenarios can yield computational savings of up to 80%.

  • Research Article
  • 10.1287/opre.1110.0925
Contributors
  • Feb 1, 2011
  • Operations Research
  • Sandro Bosio

Contributors

  • Research Article
  • Cite Count Icon 5
  • 10.5267/j.ijiec.2022.7.004
Two-stage stochastic programming for the inventory routing problem with stochastic demands in fuel delivery
  • Jan 1, 2022
  • International Journal of Industrial Engineering Computations
  • Zhenping Li + 1 more

The inventory routing problem (IRP) arises in the joint practices of vendor-managed inventory (VMI) and vehicle routing problem (VRP), aiming to simultaneously optimize the distribution, inventory and vehicle routes. This paper studies the multi-vehicle multi-compartment inventory routing problem with stochastic demands (MCIRPSD) in the context of fuel delivery. The problem with maximum-to-level (ML) replenishment policy is modeled as a two-stage stochastic programming model with the purpose of minimizing the total cost, in which the inventory management and routing decisions are made in the first stage while the corresponding resource actions are implemented in the second stage. An acceleration strategy is incorporated into the exact single-cut Benders decomposition algorithm and its multi-cut version respectively to solve the MCIRPSD on the small instances. Two-phase heuristic approaches based on the single-cut decomposition algorithm and its multi-cut version are developed to deal with the MCIRPSD on the medium and large-scale instances. Comparing the performance of the proposed algorithms with the Gurobi solver within limited time, the average objective value obtained by the proposed algorithm has decreased more than 7.30% for the medium and large instances, which demonstrates the effectiveness of our algorithms. The impacts of the instance features on the results are further analyzed, and some managerial insights are concluded for the manager.

  • Research Article
  • 10.1108/mscra-06-2024-0024
Assembling the avengers! Trends and knowledge themes in 100 most-cited publications on the blood supply chain: a bibliometric analysis
  • May 26, 2025
  • Modern Supply Chain Research and Applications
  • Ismail Abdi Changalima

PurposeThis paper assembles the 100 most-cited journal articles and reviews on the blood supply chain to analyze the research trends and knowledge themes.Design/methodology/approachThe study conducted a bibliometric analysis of the 100 most-cited journal articles and reviews published between 2004 and 2023 in the context of the blood supply chain, as retrieved from the Scopus database. The study analyzed research trends in this area and knowledge themes through a keyword co-occurrence analysis and bibliographic coupling using VOSviewer.FindingsThe study reveals that the top 100 most-cited journal articles and reviews within the domain of the blood supply chain were published between 2004 and 2023, with a notable surge in research activity observed in the recent years. Through keyword analysis, five themes, which are “inventory management and supply chain management operations for blood products,” “stochastic and possibilistic programming for designing blood supply chain network and sustainability,” “robust optimization in blood supply chain network design under uncertainty and disaster conditions,” “Network design using multi-objective, possibilistic and two-stage stochastic programming in blood supply chains” and “perishable inventory management in blood supply chains,” were obtained. On the other hand, four themes from bibliographic coupling include “designing blood supply chain networks during the time of disasters and emergencies,” “optimizing and managing blood supply chains for enhancing efficiency and reducing wastage,” “inventory management and advanced techniques in the context blood supply chain management” and “enhancing safety, efficiency and continuous blood supply in the face of various challenges.”Originality/valueThis study uniquely analyzed 100 most-cited publications in the realm of the blood supply chain, which provides relevant knowledge themes that are crucial for future studies and contributing to the knowledge in this research domain.

  • Research Article
  • Cite Count Icon 107
  • 10.1016/j.ijpe.2015.09.033
Investigating risk and robustness measures for supply chain network design under demand uncertainty: A case study of glass supply chain
  • Oct 13, 2015
  • International Journal of Production Economics
  • Kannan Govindan + 1 more

Investigating risk and robustness measures for supply chain network design under demand uncertainty: A case study of glass supply chain

  • Conference Article
  • Cite Count Icon 1
  • 10.1109/icebeg.2011.5887165
Simulation analysis of VMI policies for the multi-period IRP with stochastic demands
  • May 1, 2011
  • Chao-Hua Chen + 1 more

The supply-chain-to-supply-chain business competition model creates the Vendor Managed Inventory (VMI) management model for the retailers. The previous studies show that most of the conventional inventory management strategies apply fixed-quantity or order-up-to-level supplement methods, rarely mentioning the impact of supplement quantity cutting or non-order-up-to-level strategy on the total cost of the inventory routing of VMI management model. This study proposes the inventory management strategy with supplement threshold and upper limit, builds an inventory management model by the systematic analogical method and analyzes the effects of the inventory management strategy with combinations of different supplement thresholds and upper limits on the total cost of the inventory routing as well as the relatively changing situation of the inventories and transportation costs of the suppliers and retailers in a two-phase supply chain system of one supplier and multiple retailers under the condition that the demand time point and demand quantity of the retailers' customers are fixed and undetermined. When the system simulates the supplier to execute the inventory management, it firstly applies the supplement threshold after each supplement cycle to update the minimum inventory level, thus determining the collection of the supplement objects for the next supplement cycle and secondly determines the supplement quantity based on the upper limit of supplement. After that, it works out the optimal routine for supplement delivery and calculates the total cost of the inventory routing for the supplement cycle by linear programming method based on the analogue results. The study results indicate that the inventory management strategy combining the supplement-quantity-cut upper limit of supplement with the slightly-raised supplement threshold can effectively reduce the total cost, improve the vehicle utilization rate and maintain a certain service performance; moreover, it can delivery the supplement to more retailers during the supplement cycle to offer a new inventory routing management strategy for reference by the practitioner when making decisions.

  • Research Article
  • 10.30574/ijsra.2023.8.1.0137
Optimizing inventory management through reinforcement learning
  • Feb 28, 2023
  • International Journal of Science and Research Archive
  • Oluwatumininu Anne Ajayi

Inventory management remains a cornerstone of effective supply chain performance, directly influencing cost efficiency, service quality, and organizational agility. In today’s hypercompetitive and uncertain market environment, inventory decisions must account for complex variables such as fluctuating demand, supply disruptions, lead time variability, and market seasonality. Traditional inventory control models such as the Economic Order Quantity (EOQ), base-stock policies, and (s, S) strategies are often static in nature. They rely on pre-defined parameters and assume stationarity in demand and supply, limiting their ability to respond dynamically to real-time changes. In contrast, Reinforcement Learning (RL) offers a paradigm shift in how inventory decisions can be optimized. As a subfield of machine learning, RL enables agents to learn optimal strategies through repeated interactions with an environment, using trial-and-error exploration and reward-based feedback. RL agents can observe the system state (e.g., inventory levels, demand signals, lead time status), choose actions (e.g., place an order or wait), and receive feedback in the form of rewards (e.g., service level achievements or cost penalties), thus iteratively improving their policies. This study explores how RL can be applied to optimize inventory management in environments characterized by uncertainty and real-time decision-making needs. Specifically, we investigate how different RL algorithms such as Q-learning, Deep Q Networks (DQN), and Policy Gradient methods perform in various inventory scenarios. Additionally, we examine the computational and operational implications of deploying RL in real-world settings, including issues of model convergence, exploration-exploitation tradeoffs, data requirements, and scalability. We also discuss how RL can complement other AI techniques such as demand forecasting models and predictive analytics in creating end-to-end intelligent supply chain solutions. By bridging the gap between theoretical RL frameworks and practical inventory management applications, this paper contributes to both the academic literature and industrial practice. Our goal is to demonstrate that RL is not only a theoretically elegant solution but also a viable tool for achieving inventory efficiency and supply chain resilience.

  • Research Article
  • Cite Count Icon 4
  • 10.1080/23249935.2017.1352626
Minimum cost flow problem formulation for the static vehicle allocation problem with stochastic lane demand in truckload strategic planning
  • Jul 17, 2017
  • Transportmetrica A: Transport Science
  • Rodrigo Mesa-Arango + 1 more

ABSTRACTThis paper studies the complex strategic planning problem faced by truckload (TL) companies related to finding the allocation of vehicles serving lanes with stochastic demand within their networks. Lanes are origin–destination pairs associated with volumes of TLs per unit of time that follow discrete probability distributions. Finding strategic vehicle allocations that anticipate demand variability is critical because unrealized demands increase repositioning costs and reduce perceived revenues. The allocation problem is formulated as a two-stage stochastic program. The paper proposes a novel procedure based on network transformations that formulate the complex stochastic problem as a minimum cost-flow problem (MCFP), which can be efficiently solved with classic MCFP algorithms. Numerical experiments demonstrate the computational efficiencies of the method (instances with 1404 nodes and 98,904 arcs solved in 20.74 seconds) and reinforce the necessity of incorporating stochastic effects to avoid overestimation from deterministic approaches.

  • Research Article
  • Cite Count Icon 2
  • 10.1007/s10489-023-04647-9
Automated market maker inventory management with deep reinforcement learning
  • Jun 24, 2023
  • Applied Intelligence
  • Óscar Fernández Vicente + 2 more

Stock markets are the result of the interaction of multiple participants, and market makers are one of them. Their main goal is to provide liquidity and market depth to the stock market by streaming bids and offers at both sides of the order book, at different price levels. This activity allows the rest of the participants to have more available prices to buy or sell stocks. In the last years, reinforcement learning market maker agents have been able to be profitable. But profit is not the only measure to evaluate the quality of a market maker. Inventory management arises as a risk source that must be under control. In this paper, we focus on inventory risk management designing an adaptive reward function able to control inventory depending on designer preferences. To achieve this, we introduce two control coefficients, AIIF (Alpha Inventory Impact Factor) and DITF (Dynamic Inventory Threshold Factor), which modulate dynamically the behavior of the market maker agent according to its evolving liquidity with good results. In addition, we analyze the impact of these factors in the trading operative, detailing the underlying strategies performed by these intelligent agents in terms of operative, profitability and inventory management. Last, we present a comparison with other existing reward functions to illustrate the robustness of our approach.Graphic

  • Conference Article
  • Cite Count Icon 7
  • 10.1109/bigdataservice.2017.36
An Advanced Inventory Data Mining System for Business Intelligence
  • Apr 1, 2017
  • Qifeng Zhou + 5 more

Inventory management plays a critical role to track inventory levels, orders, and sales of the retailing business. Effective inventory management is a capability necessary to lead in the global marketplace. In the current retailing market, a huge amount of data regarding stocked items in inventory is generated and collected every day. Due to the increasing volume of transaction data and their correlated relations, it is often a non-trivial task to efficiently manage stocked goods, yet it is imperative to explore the underlying dependencies of the inventory items and give insights into implementing intelligent management systems. However, existing inventory management systems rely on statistical analysis of the historical inventory data, and have a limited capability of intelligent management. For example, they usually do not have the ability to forecast item demand and detect anomalous patterns of item inventory transactions. There is little work reported in implementing intelligent inventory management solutions to reveal hidden relations with integrated data-driven analysis. In this paper, we present an intelligent system, called iMiner, to facilitate managing enormous inventory data. We utilize distributed computing resources to process the huge volume of inventory data and incorporate the latest advances in data mining technologies. iMiner provides comprehensive support for conducting many inventory management tasks, such as forecasting inventory, detecting anomalous items, and analyzing inventory aging.

  • Research Article
  • Cite Count Icon 15
  • 10.1007/s00170-014-6569-x
A robust decision-making approach for p-hub median location problems based on two-stage stochastic programming and mean-variance theory: a real case study
  • Nov 25, 2014
  • The International Journal of Advanced Manufacturing Technology
  • Taher Ahmadi + 3 more

The stochastic location-allocation p-hub median problems are related to long-term decisions made in risky situations. Due to the importance of this type of problems in real-world applications, the authors were motivated to propose an approach to obtain more reliable policies in stochastic environments considering the decision makers’ preferences. Therefore, a systematic approach to make robust decisions for the single location-allocation p-hub median problem based on mean-variance theory and two-stage stochastic programming was developed. The approach involves three main phases, namely location modeling, risk modeling, and decision making, each including several steps. In the first phase, the pertinent location-allocation model of the problem is developed in the form of a two-stage stochastic model based on its deterministic version. A risk measure, based on total cost function and mean-variance theory, is derived in the second phase. Furthermore, two heterogeneous terms of the risk measure have been normalized and an innovative procedure has been proposed to significantly improve the calculation efficiency. In the third phase, the Pareto solution is obtained, the frontier curve is depicted to determine the decision maker’s risk aversion coefficient, and a robust policy is obtained through optimization based on decision makers’ preferences. Finally, a case study of an automobile part distribution system with stochastic demand is described to further illustrate our risk management and analysis approach.

  • Research Article
  • Cite Count Icon 26
  • 10.1016/j.trc.2018.07.018
Adaptive traffic signal control with equilibrium constraints under stochastic demand
  • Aug 4, 2018
  • Transportation Research Part C: Emerging Technologies
  • Wei Huang + 2 more

Adaptive traffic signal control with equilibrium constraints under stochastic demand

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 11
  • 10.3390/math9131572
A Multi-Depot Vehicle Routing Problem with Stochastic Road Capacity and Reduced Two-Stage Stochastic Integer Linear Programming Models for Rollout Algorithm
  • Jul 4, 2021
  • Mathematics
  • Wadi Khalid Anuar + 3 more

A matheuristic approach based on a reduced two-stage Stochastic Integer Linear Programming (SILP) model is presented. The proposed approach is suitable for obtaining a policy constructed dynamically on the go during the rollout algorithm. The rollout algorithm is part of the Approximate Dynamic Programming (ADP) lookahead solution approach for a Markov Decision Processes (MDP) framed Multi-Depot Dynamic Vehicle Routing Problem with Stochastic Road Capacity (MDDVRPSRC). First, a Deterministic Multi-Depot VRP with Road Capacity (D-MDVRPRC) is presented. Then an extension, MDVRPSRC-2S, is presented as an offline two-stage SILP model of the MDDVRPSRC. These models are validated using small simulated instances with CPLEX. Next, two reduced versions of the MDVRPSRC-2S model (MDVRPSRC-2S1 and MDVRPSRC-2S2) are derived. They have a specific task in routing: replenishment and delivering supplies. These reduced models are to be utilised interchangeably depending on the capacity of the vehicle, and repeatedly during the execution of rollout in reinforcement learning. As a result, it is shown that a base policy consisting of an exact optimal decision at each decision epoch can be obtained constructively through these reduced two-stage stochastic integer linear programming models. The results obtained from the resulting rollout policy with CPLEX execution during rollout are also presented to validate the reduced model and the matheuristic algorithm. This approach is proposed as a simple implementation when performing rollout for the lookahead approach in ADP.

  • Research Article
  • Cite Count Icon 4
  • 10.1088/1742-6596/1378/3/032072
Inventory replenishment in multi-stage production setting under stochastic demand: a review
  • Dec 1, 2019
  • Journal of Physics: Conference Series
  • O Ongbali Samson + 3 more

Inventory management is central to production planning and control particularly in multi-stage production environment where production output is stochastic and customer demand is also stochastic. Surplus inventory ties down money and stock-out situation result in loss of value and goodwill. Therefore, it is necessary determine optimal inventory policies for different manufacturing scenarios to maintain a balance between safety stock inventory and customer demand satisfaction at all time. Consequently, this review attempts to identify and document the underlying trends and most recent methods of inventory replenishment under stochastic demand with emphasis on multi-stage production setting. Prominent in literature among the models used to treat inventory problem in stochastic demand situation is “Approximation by Probabilistic Distribution”. Other models used include, Genetic Algorithm (GA), Just-in-time with Kanban simulation, Markov Process Decision, Fuzzy Inventory Model, Multi-stage inventory-queue model and Demand forecasting among others. It appears that there exist only approximate solutions than exact solutions in solving stochastic demand inventory problem suggesting that there is need for more work to be done in the area toward achieving exact solutions to the problem than approximation.

  • Research Article
  • Cite Count Icon 12
  • 10.1016/j.trb.2022.05.015
A stochastic optimization approach for the supply vessel planning problem under uncertain demand
  • Jun 21, 2022
  • Transportation Research Part B: Methodological
  • A.M.P Santos + 3 more

A stochastic optimization approach for the supply vessel planning problem under uncertain demand

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon