Abstract

In this paper we revise the theory of turnpikes in discounted Markov decision processes, prove the turnpike theorem for the undiscounted model, and apply the results to the specific random walk.KeywordsturnpikeMarkov decision processdiscounted rewardaverage rewardrandom walkstochastic knapsack problem

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call