Learning equilibrium mean‐variance strategy

Min Dai,Yuchao Dong,Yanwei Jia

doi:10.1111/mafi.12402

Abstract

AbstractWe study a dynamic mean‐variance portfolio optimization problem under the reinforcement learning framework, where an entropy regularizer is introduced to induce exploration. Due to the time–inconsistency involved in a mean‐variance criterion, we aim to learn an equilibrium policy. Under an incomplete market setting, we obtain a semi‐analytical, exploratory, equilibrium mean‐variance policy that turns out to follow a Gaussian distribution. We then focus on a Gaussian mean return model and propose a reinforcement learning algorithm to find the equilibrium policy. Thanks to a thoroughly designed policy iteration procedure in our algorithm, we prove the convergence of our algorithm under mild conditions, despite that dynamic programming principle and the usual policy improvement theorem failing to hold for an equilibrium policy. Numerical experiments are given to demonstrate our algorithm. The design and implementation of our reinforcement learning algorithm apply to a general market setup.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning equilibrium mean‐variance strategy

Abstract

Talk to us

Similar Papers

More From: Mathematical Finance

Lead the way for us

Journal: Mathematical Finance	Publication Date: Jun 4, 2023
Citations: 6

Similar Papers

Learning Equilibrium Mean-Variance Strategy
Min Dai ... Yanwei Jia
SSRN Electronic Journal | VOL. -
Min Dai, et. al.Min Dai ... Yanwei Jia
12 Mar 2021
SSRN Electronic Journal | VOL. -

OpenGraphGym: A Parallel Reinforcement Learning Framework for Graph Optimization Problems
Weijian Zheng ... Fengguang Song
-
Weijian Zheng, et. al.Weijian Zheng ... Fengguang Song
01 Jan 2020
01 Jan 2020

Reinforcement learning with algorithms from probabilistic structure estimation
Jonathan P Epperlein ... Robert Shorten
Automatica | VOL. 144
Jonathan P Epperlein, et. al.Jonathan P Epperlein ... Robert Shorten
06 Aug 2022
Automatica | VOL. 144

A Neural Network Based Automatic Generation Controller Design through Reinforcement Learning
...
International Journal of Emerging Electric Power Systems | VOL. 6
, et. al. ...
20 May 2006
International Journal of Emerging Electric Power Systems | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning equilibrium mean‐variance strategy

Abstract

Talk to us

Similar Papers

More From: Mathematical Finance