Convergence of synchronous reinforcement learning with linear function approximation

Artur Merke,Ralf Schoknecht

doi:10.1145/1015330.1015390

Abstract

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merke, 2003). In this paper we state conditions of convergence for general inhomogeneous matrix iterations and prove that they are both necessary and sufficient. This result extends the work presented in (Schoknecht & Merke, 2003), where only a sufficient condition of convergence was proved. As the condition of convergence is necessary and sufficient, the new result is suitable to prove convergence and divergence of RL algorithms with function approximation. We use the theorem to deduce a new concise proof of convergence for the synchronous residual gradient algorithm (Baird, 1995). Moreover, we derive a counterexample for which the uniform RL algorithm (Merke & Schoknecht, 2002) diverges. This yields a negative answer to the open question if the uniform RL algorithm converges for arbitrary multiple transitions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convergence of synchronous reinforcement learning with linear function approximation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin ... Zhuoran Yang
Mathematics of Operations Research | VOL. 48
Chi Jin, et. al.Chi Jin ... Zhuoran Yang
28 Mar 2023
Mathematics of Operations Research | VOL. 48

Reinforcement Learning Based Precise Positioning Method for a Millimeters-Sized Omnidirectional Mobile Microrobot
Jianghao Li ... Jiapin Chen
-
Jianghao Li, et. al.Jianghao Li ... Jiapin Chen
01 Jan 2008
01 Jan 2008

Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling
Huaqing Xiong ... Tengyu Xu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Huaqing Xiong, et. al.Huaqing Xiong ... Tengyu Xu
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Conditions on Features for Temporal Difference-Like Methods to Converge
Marcus Hutter ... Sultan Javed Majeed
-
Marcus Hutter, et. al.Marcus Hutter ... Sultan Javed Majeed
01 Aug 2019
01 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convergence of synchronous reinforcement learning with linear function approximation

Abstract

Talk to us

Similar Papers