Abstract
We consider the problem of minimizing the long-run average expected cost per unit time in a semi-Markov decision process with arbitrary state and action space. Using the idea of successive approximations, sufficient conditions for the existence of an optimal stationary policy are given. These results are applied to solve the replacement problem with a semi-Markov shock model.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have