Risk-Sensitive Portfolio Management by Using C51 Algorithm

Thammasorn Harnpadungkij,Warasinee Chaisangmongkon,Phond Phunchongharn

doi:10.12982/cmjs.2022.094

Abstract

Financial trading is one of the most popular problems for reinforcement learning in recent years. One of the important challenges is that investment is a multi-objective problem. That is, professional investors do not act solely on expected profi t but also carefully consider the potential risk of a given investment. To handle such a challenge, previous studies have explored various kinds of risk-sensitive rewards, for example, the Sharpe ratio as computed by a fi xed length of previous returns. This work proposes a new approach to deal with the profi t-to-risk tradeoff by applying distributional reinforcement learning to build a risk awareness policy instead of a simple risk-based reward function. Our new policy, termed C51-Sharpe, is to select the action based on the Sharpe ratio computed from the probability mass function of the return. This produces a signifi cantly higher Sharpe ratio and lower maximum drawdown without sacrifi cing profi t compared to the C51algorithm utilizing a purely profi t-based policy. Moreover, it can outperform other benchmarks, such as a Deep Q-Network (DQN) with a Sharpe ratio reward function. Besides the policy, we also studied the effect of using double networks and the choice of exploration strategies with our approach to identify the optimal training confi guration. We fi nd that the epsilon-greedy policy is the most suitable exploration for C51-Sharpe and that the use of double network has no signifi cant impact on performance. Our study provides statistical evidence of the effi ciency in risk-sensitive policy implemented by using distributional reinforcement algorithms along with an optimized training process.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Risk-Sensitive Portfolio Management by Using C51 Algorithm

Abstract

Talk to us

Similar Papers

More From: Chiang Mai Journal of Science

Lead the way for us

Similar Papers

Modeling Low-risk Actions from Multivariate Time Series Data Using Distributional Reinforcement Learning
Yosuke Sato ... Jianwei Zhang
-
Yosuke Sato, et. al.Yosuke Sato ... Jianwei Zhang
28 Sep 2020
28 Sep 2020

Sharpening Sharpe Ratios
William Goetzmann ... Ivo Welch
-
William Goetzmann, et. al.William Goetzmann ... Ivo Welch
01 Aug 2002
01 Aug 2002

Statistical Inference for Sharpe Ratio
Friedrich Schmid ... Rafael Schmidt
-
Friedrich Schmid, et. al.Friedrich Schmid ... Rafael Schmidt
01 Jan 2009
01 Jan 2009

Effect of Booms and Busts on the Sharpe Ratio
Ziemowit Bednarek ... Pratish Patel
The Journal of Portfolio Management | VOL. 43
Ziemowit Bednarek, et. al.Ziemowit Bednarek ... Pratish Patel
31 Jan 2017
The Journal of Portfolio Management | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Risk-Sensitive Portfolio Management by Using C51 Algorithm

Abstract

Talk to us

Similar Papers

More From: Chiang Mai Journal of Science