An $L^2$ Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation

Jihao Long Jihao Long,Jiequn Han Jiequn Han,Weinan E Weinan E

doi:10.4208/csiam-am.so-2021-0026

Abstract

Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is made either using the kernel method or the two-layer neural network model, in the context of a fitted Q-iteration algorithm with explicit regularization. We establish an $\tilde{O}(H^3|\mathcal {A}|^{\frac14}n^{-\frac14})$ bound for the optimal policy with $Hn$ samples, where $H$ is the length of each episode and $|\mathcal {A}|$ is the size of action space. Our analysis hinges on analyzing the $L^2$ error of the approximated Q-function using $n$ data points. Even though this result still requires a finite-sized action space, the error bound is independent of the dimensionality of the state space.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An $L^2$ Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation

Abstract

Talk to us

Similar Papers

More From: CSIAM Transactions on Applied Mathematics

Lead the way for us

Journal: CSIAM Transactions on Applied Mathematics	Publication Date: Jan 1, 2022
Citations: 1

Similar Papers

Lower bounds for artificial neural network approximations: A proof that shallow neural networks fail to overcome the curse of dimensionality
Philipp Grohs ... Sarah Koppensteiner
Journal of Complexity | VOL. 77
Philipp Grohs, et. al.Philipp Grohs ... Sarah Koppensteiner
01 Mar 2023
Journal of Complexity | VOL. 77

Reinforcement learning algorithms with function approximation: Recent advances and applications
Xin Xu ... Zhenhua Huang
Information Sciences | VOL. 261
Xin Xu, et. al.Xin Xu ... Zhenhua Huang
05 Sep 2013
Information Sciences | VOL. 261

Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
Juan C Santamaria ... Ashwin Ram
Adaptive Behavior | VOL. 6
Juan C Santamaria, et. al.Juan C Santamaria ... Ashwin Ram
01 Sep 1997
Adaptive Behavior | VOL. 6

Non-linear Continuous Action Spaces for Reinforcement Learning in Type 1 Diabetes
Chirath Hettiarachchi ... Christopher J Nolan
-
Chirath Hettiarachchi, et. al.Chirath Hettiarachchi ... Christopher J Nolan
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An $L^2$ Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation

Abstract

Talk to us

Similar Papers

More From: CSIAM Transactions on Applied Mathematics