Abstract

This paper studies the problem of integrated lot-sizing and maintenance decision making in case of multiple products and stochastic demand. The problem is formulated as a Markov decision process, in which the goal is to find a joint production and maintenance policy that minimizes the long run expected total discounted cost. Therefore, the classic Q-learning algorithm is adopted, and a decomposition-based approximate Q-value heuristic is developed to obtain near-optimal solutions in a reasonable time. To accelerate the convergence of the Q-learning algorithm, a hybrid Q-learning method is proposed in which the Q-values are initiated by the output of the decomposition-based approximate Q-value heuristic. The numerical experiments reveal that the approximate Q-value heuristic is outperformed by the classic and hybrid Q-learning algorithms in terms of accuracy and that the hybrid Q-learning method converges much faster than the classic Q-learning method. However, these so-called tabular methods do not scale to larger problems with more than four products. Hence, based on the problem structure, three state aggregation schemes are developed and applied to the Q-learning algorithm to solve the large-scale problems. The numerical study demonstrates that Q-learning with the third state aggregation scheme performs nearly as good as the hybrid Q-learning method while significantly reducing the computational time and being scalable to large-scale problems.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.