Finite Markov Chains and Markov Decision Processes

Tomoyuki Shirai

doi:10.1007/978-4-431-55060-0_15

Finite Markov Chains and Markov Decision Processes

Tomoyuki Shirai

https://doi.org/10.1007/978-4-431-55060-0_15

Copy DOI

Publication Date: Jan 1, 2014

Affiliation: Kyushu University, Institute of Mathematics and Informatics, Czech Academy of Sciences, Institute of Mathematics

#Finite State Spaces #Markov Decision Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Markov chains are important tools used for stochastic modeling in various areas of mathematical sciences. The first section of this article presents a survey of the basic notions of discrete-time Markov chains on finite state spaces together with several illustrative examples. Markov decision processes (MDPs), which are also known as stochastic dynamic programming or discrete-time stochastic control, are useful for decision making under uncertainty. The second section will provide a simple formulation of MDPs with finite state spaces and actions, and give two important algorithms for solving MDPs, value iteration and policy iteration, with an example on iPod shuffle.

Full Text