Markov decision processes with quasi-hyperbolic discounting

Anna Jaśkiewicz,Andrzej S Nowak

doi:10.1007/s00780-020-00443-2

Abstract

We study Markov decision processes with Borel state spaces under quasi-hyperbolic discounting. This type of discounting nicely models human behaviour, which is time-inconsistent in the long run. The decision maker has preferences changing in time. Therefore, the standard approach based on the Bellman optimality principle fails. Within a dynamic game-theoretic framework, we prove the existence of randomised stationary Markov perfect equilibria for a large class of Markov decision processes with transitions having a density function. We also show that randomisation can be restricted to two actions in every state of the process. Moreover, we prove that under some conditions, this equilibrium can be replaced by a deterministic one. For models with countable state spaces, we establish the existence of deterministic Markov perfect equilibria. Many examples are given to illustrate our results, including a portfolio selection model with quasi-hyperbolic discounting.

Highlights

The discounted utility approach in dynamic decision making has been used since the beginning of modern economic theory; see e.g. Samuelson [59]
The strategy for the decision maker built from a Markov perfect equilibrium in the game is time-consistent, that is, no self has an incentive to change his best response to equilibrium strategies of the following selves
We have studied a fairly general class of time-inconsistent Markov decision processes with a Borel state space

Summary

Introduction

The discounted utility approach in dynamic decision making has been used since the beginning of modern economic theory; see e.g. Samuelson [59]. Alj and Haurie [4] extended the finite state space model of Shapley to quasi-hyperbolic discounting They used an intergenerational dynamic game formulation of Phelps and Pollak [55] and proved that any finite horizon game has an equilibrium in Markovian strategies and each infinite horizon game has a stationary Markov perfect equilibrium. As already mentioned time-inconsistent preferences in various control models were recently studied by Björk and Murgoci [15], Björk et al [14], Christensen and Lindensjö [20], these papers, in contrast to our present work and works of Alj and Haurie [4], Jaskiewicz and Nowak [35], Nowak [51], examine neither stationary Markov perfect equilibria nor fixed points of best-response mappings. We consider Markov decision processes with a Borel state space and quasi-hyperbolic discounting and the Markov perfect equilibrium as a basic solution concept.

The model and main solution concept

Basic assumptions and three equilibrium theorems

Some comments on the proofs and possible extensions

Examples and an overview of selected literature

Existence of deterministic non-stationary Markov perfect equilibria

An example with a finite state space

Concluding remarks

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Finance and Stochastics	Publication Date: Nov 18, 2020
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

Markov decision processes with quasi-hyperbolic discounting

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Finance and Stochastics

Lead the way for us

Similar Papers

On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs
Huizhen Yu
Mathematics of Operations Research | VOL. 47
Huizhen YuHuizhen Yu
15 Dec 2021
Mathematics of Operations Research | VOL. 47

Equivalence of Lyapunov stability criteria in a class of Markov decision processes
Rolando Cavazos-Cadena ... On�simo Hern�ndez-Lerma
Applied Mathematics & Optimization | VOL. 26
Rolando Cavazos-Cadena, et. al.Rolando Cavazos-Cadena ... On�simo Hern�ndez-Lerma
01 Sep 1992
Applied Mathematics & Optimization | VOL. 26

Nondiscounted Continuous Time Markovian Decision Process with Countable State Space
Prasadarao Kakumanu
SIAM Journal on Control | VOL. 10
Prasadarao KakumanuPrasadarao Kakumanu
01 Feb 1972
SIAM Journal on Control | VOL. 10

Maximizing the probability of visiting a set infinitely often for a Markov decision process with Borel state and action spaces
François Dufour ... Tomás Prieto-Rumeau
Journal of Applied Probability | VOL. -
François Dufour, et. al.François Dufour ... Tomás Prieto-Rumeau
22 Aug 2024
Journal of Applied Probability | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Markov decision processes with quasi-hyperbolic discounting

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Finance and Stochastics