From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization.

Zitao Song,Yining Wang,Pin Qian,Frans Coenen,Zhengyong Jiang,Jionglong Su,Sifan Song

doi:10.1007/s10489-022-04217-5

Abstract

As a fundamental problem in algorithmic trading, portfolio optimization aims to maximize the cumulative return by continuously investing in various financial derivatives within a given time period. Recent years have witnessed the transformation from traditional machine learning trading algorithms to reinforcement learning algorithms due to their superior nature of sequential decision making. However, the exponential growth of the imperfect and noisy financial data that is supposedly leveraged by the deterministic strategy in reinforcement learning, makes it increasingly challenging for one to continuously obtain a profitable portfolio. Thus, in this work, we first reconstruct several deterministic and stochastic reinforcement algorithms as benchmarks. On this basis, we introduce a risk-aware reward function to balance the risk and return. Importantly, we propose a novel interpretable stochastic reinforcement learning framework which tailors a stochastic policy parameterized by Gaussian Mixtures and a distributional critic realized by quantiles for the problem of portfolio optimization. In our experiment, the proposed algorithm demonstrates its superior performance on U.S. market stocks with a 63.1% annual rate of return while at the same time reducing the market value max drawdown by 10% when back-testing during the stock market crash around March 2020.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization.

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: Nov 11, 2022
Citations: 7

Similar Papers

Reinforcement learning-based portfolio optimization with deterministic state transition
Guangle Song ... Chaoran Cui
Information Sciences | VOL. 690
Guangle Song, et. al.Guangle Song ... Chaoran Cui
09 Oct 2024
Information Sciences | VOL. 690

A new hybrid method of recurrent reinforcement learning and BiLSTM for algorithmic trading
Yuling Huang ... Yunlin Song
Journal of Intelligent & Fuzzy Systems | VOL. 45
Yuling Huang, et. al.Yuling Huang ... Yunlin Song
01 Aug 2023
Journal of Intelligent & Fuzzy Systems | VOL. 45

Research on Portfolio Optimization Models Using Deep Deterministic Policy Gradient
Li Wei ... Zhang Weiwei
-
Li Wei, et. al.Li Wei ... Zhang Weiwei
01 Nov 2020
01 Nov 2020

Portfolio Optimization Under the Framework of Reinforcement Learning
Li Xucheng ... Peng Zhihao
-
Li Xucheng, et. al.Li Xucheng ... Peng Zhihao
01 Apr 2019
01 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization.

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence