Abstract

We consider the problem of minimizing the long-run average expected cost per unit time in a semi-Markov decision process with arbitrary state and action space. Using the idea of successive approximations, sufficient conditions for the existence of an optimal stationary policy are given. These results are applied to solve the replacement problem with a semi-Markov shock model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call