An adaptive dual-level reinforcement learning approach for optimal trade execution

Soohan Kim,Jimyeong Kim,Hong Kee Sul,Youngjoon Hong

doi:10.1016/j.eswa.2024.124263

Abstract

The purpose of this research is to devise a tactic that can closely track the daily cumulative volume-weighted average price (VWAP) using reinforcement learning while minimizing the deviation from the VWAP. Previous studies often choose a relatively short trading horizon to implement their models, making it difficult to accurately track the daily cumulative VWAP since the stock price movement is often insignificant within the short trading horizon. On the other hand, training reinforcement learning models directly over a longer, daily horizon is burdensome due to extensive sequence length. Hence, there is a need for a method that can divide the long daily horizon into smaller, more manageable segments. We propose a method that leverages the U-shaped pattern of intraday stock trade volumes and uses Proximal Policy Optimization (PPO) as the learning algorithm. Our method follows a dual-level approach: a Transformer model that captures the overall (global) distribution of daily volumes in a U-shape, and a LSTM model that handles the distribution of orders within smaller (local) time intervals. The results from our experiments suggest that this dual-level architecture improves cumulative VWAP tracking accuracy compared to previous reinforcement learning approaches. The key finding is that explicitly accounting for the U-shaped intraday volume pattern leads to better performance in approximating the cumulative daily VWAP. This has implications for developing trading strategies that need to efficiently track VWAP over a full trading day.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An adaptive dual-level reinforcement learning approach for optimal trade execution

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Practical Application of Deep Reinforcement Learning to Optimal Trade Execution
Woo Jae Byun ... Seongmin Kim
FinTech | VOL. 2
Woo Jae Byun, et. al.Woo Jae Byun ... Seongmin Kim
29 Jun 2023
FinTech | VOL. 2

Reinforcement Learning Versus PDE Backstepping and PI Control for Congested Freeway Traffic
Huan Yu ... Alexandre Bayen
IEEE Transactions on Control Systems Technology | VOL. 30
Huan Yu, et. al.Huan Yu ... Alexandre Bayen
01 Jul 2022
IEEE Transactions on Control Systems Technology | VOL. 30

Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples
Zhe Xu ... Ufuk Topcu
-
Zhe Xu, et. al.Zhe Xu ... Ufuk Topcu
01 Jan 2020
01 Jan 2020

An optimal control strategy for execution of large stock orders using long short-term memory networks
Andrew Papanicolaou ... Hau Fu
Journal of Computational Finance | VOL. -
Andrew Papanicolaou, et. al.Andrew Papanicolaou ... Hau Fu
01 Jan 2023
Journal of Computational Finance | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An adaptive dual-level reinforcement learning approach for optimal trade execution

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications