Addressing environment non-stationarity by repeating Q-learning updates

Abdallahsherief ,Kaisersmichael

doi:10.5555/2946645.2946691

Addressing environment non-stationarity by repeating Q-learning updates

Abdallahsherief , Kaisersmichael

https://doi.org/10.5555/2946645.2946691

Copy DOI

Journal: Journal of machine learning research : JMLR	Publication Date: Jan 1, 2016
Citations: 54

#Policies In Markov Decision Processes #Markov Decision Processes + Show 7 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Q-learning (QL) is a popular reinforcement learning algorithm that is guaranteed to converge to optimal policies in Markov decision processes. However, QL exhibits an artifact: in expectation, the ...

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

R Discovery Prime

R Discovery Prime

Addressing environment non-stationarity by repeating Q-learning updates

Abstract

Talk to us

Similar Papers

More From: Journal of machine learning research : JMLR

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Addressing environment non-stationarity by repeating Q-learning updates

Abstract

Talk to us

Similar Papers

More From: Journal of machine learning research : JMLR