Abstract

The paper deals with a class of semi-Markov control models with Borel state and control spaces, possibly unbounded costs, and unknown holding times distribution H. Assuming that H does not depend on state-action pairs, we combine suitable methods of statistical estimation of H with control procedures to construct an asymptotically discounted optimal policy Open image in new window and an optimal stationary policy { f∞}, where fn converges to f∞ in the sense of Schal [12].

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call