A stability criterion for two timescale stochastic approximation schemes

Chandrashekar Lakshminarayanan,Shalabh Bhatnagar

doi:10.1016/j.automatica.2016.12.014

A stability criterion for two timescale stochastic approximation schemes

Chandrashekar Lakshminarayanan, Shalabh Bhatnagar

https://doi.org/10.1016/j.automatica.2016.12.014

Copy DOI

Export

Save

Cite

Journal: Automatica	Publication Date: Mar 6, 2017
Citations: 24

Affiliation: Indian Institute of Science Bangalore

#Stochastic Approximation Schemes #Stochastic Schemes #Ordinary Differential Equation #Ordinary Differential Equation Method #Reinforcement Learning #Differential Equation Method #Sufficient Conditions #Two-timescale Stochastic Approximation #Timescale Stochastic Approximation #Stochastic Approximation

Abstract
Full-Text
Similar Papers

Abstract

Listen

We present the first sufficient conditions that guarantee stability of two-timescale stochastic approximation schemes. Our analysis is based on the ordinary differential equation (ODE) method and is an extension of the results in Borkar and Meyn (2000) for single-timescale schemes. As an application of our result, we show the stability of iterates in a two-timescale stochastic approximation scheme arising in reinforcement learning.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Automatica

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

A stability criterion for two timescale stochastic approximation schemes