Continuous‐time mean–variance portfolio selection: A reinforcement learning framework

Haoran Wang,Xun Yu Zhou

doi:10.1111/mafi.12281

Continuous‐time mean–variance portfolio selection: A reinforcement learning framework

Haoran Wang, Xun Yu Zhou

Open Access

https://doi.org/10.1111/mafi.12281

Copy DOI

Journal: Mathematical Finance	Publication Date: Jun 23, 2020
Citations: 71

Affiliation: Columbia University

#Mean Variance Portfolio Selection #Reinforcement Learning Framework + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

AbstractWe approach the continuous‐time mean–variance portfolio selection with reinforcement learning (RL). The problem is to achieve the best trade‐off between exploration and exploitation, and is formulated as an entropy‐regularized, relaxed stochastic control problem. We prove that the optimal feedback policy for this problem must be Gaussian, with time‐decaying variance. We then prove a policy improvement theorem, based on which we devise an implementable RL algorithm. We find that our algorithm and its variant outperform both traditional and deep neural network based algorithms in our simulation and empirical studies.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Mathematical Finance

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.