Adaptive Execution: Exploration and Learning of Price Impact

Beomsoo Park,Benjamin Van Roy

doi:10.2139/ssrn.2118111

Abstract

We consider a model in which a trader aims to maximize expected risk-adjusted profit while trading a single security. In our model, each price change is a linear combination of observed factors, impact resulting from the trader’s current and prior activity, and unpredictable random effects. The trader must learn coefficients of a price impact model while trading. We propose a new method for simultaneous execution and learning – the confidence-triggered regularized adaptive certainty equivalent (CTRACE) policy – and establish a poly-logarithmic finite-time expected regret bound. This bound implies that CTRACE is efficient in the sense that the (e,δ)-convergence time is bounded by a polynomial function of 1/e and log(1/δ) with high probability. In addition, we demonstrate via Monte Carlo simulation that CTRACE outperforms the certainty equivalent policy and a recently proposed reinforcement learning algorithm that is designed to explore efficiently in linear-quadratic control problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Execution: Exploration and Learning of Price Impact

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal

Lead the way for us

Journal: SSRN Electronic Journal	Publication Date: Jul 27, 2012
Citations: 3

Similar Papers

Adaptive Execution: Exploration and Learning of Price Impact
Beomsoo Park ... Benjamin Van Roy
Operations Research | VOL. 63
Beomsoo Park, et. al.Beomsoo Park ... Benjamin Van Roy
01 Oct 2015
Operations Research | VOL. 63

Certainty Equivalent Pricing Under Sales-Dependent and Inventory-Dependent Demand
Hyun-Soo Ahn ... Mengzhenyu Zhang
SSRN Electronic Journal | VOL. -
Hyun-Soo Ahn, et. al.Hyun-Soo Ahn ... Mengzhenyu Zhang
01 Jul 2021
SSRN Electronic Journal | VOL. -

Carbon risk and optimal retrofitting in cement plants: An application of stochastic modelling, MonteCarlo simulation and Real Options Analysis
Luis M Abadie ... Ibon Galarraga
Journal of Cleaner Production | VOL. 142
Luis M Abadie, et. al.Luis M Abadie ... Ibon Galarraga
28 Oct 2016
Journal of Cleaner Production | VOL. 142

Three-State Opinion Formation Model on Adaptive Networks and Time to Consensus
Degang Wu ... Kwok Yip Szeto
-
Degang Wu, et. al.Degang Wu ... Kwok Yip Szeto
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Execution: Exploration and Learning of Price Impact

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal