Abstract
In a recent work, the authors considered a finite state Markov ratio decision process in which the objective was to maximize the ratio of total discounted rewards. In this paper, discounted Markov ratio decision processes are generalized to discounted stochastic ratio games. These may also be viewed as generalizations of ratio games to a stochastic context where the payoff is the ratio of the two total discounted rewards. We show that in the discounted stochastic ratio game the players have stationary optimal strategies with a unique value. The solution may depend on the initial probability distribution. We also provide a convergent algorithm.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.