Abstract
In this paper we revise the theory of turnpikes in discounted Markov decision processes, prove the turnpike theorem for the undiscounted model, and apply the results to the specific random walk.KeywordsturnpikeMarkov decision processdiscounted rewardaverage rewardrandom walkstochastic knapsack problem
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have