Q-Learning: Theory and Applications

Jesse Clifton,Eric Laber

doi:10.1146/annurev-statistics-031219-041220

Abstract

Q-learning, originally an incremental algorithm for estimating an optimal decision strategy in an infinite-horizon decision problem, now refers to a general class of reinforcement learning methods widely used in statistics and artificial intelligence. In the context of personalized medicine, finite-horizon Q-learning is the workhorse for estimating optimal treatment strategies, known as treatment regimes. Infinite-horizon Q-learning is also increasingly relevant in the growing field of mobile health. In computer science, Q-learning methods have achieved remarkable performance in domains such as game-playing and robotics. In this article, we ( a) review the history of Q-learning in computer science and statistics, ( b) formalize finite-horizon Q-learning within the potential outcomes framework and discuss the inferential difficulties for which it is infamous, and ( c) review variants of infinite-horizon Q-learning and the exploration-exploitation problem, which arises in decision problems with a long time horizon. We close by discussing issues arising with the use of Q-learning in practice, including arguments for combining Q-learning with direct-search methods; sample size considerations for sequential, multiple assignment randomized trials; and possibilities for combining Q-learning with model-based methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Q-Learning: Theory and Applications

Abstract

Talk to us

Similar Papers

More From: Annual Review of Statistics and Its Application

Lead the way for us

Journal: Annual Review of Statistics and Its Application	Publication Date: Mar 9, 2020
Citations: 168

Similar Papers

Sample size and power considerations for ordinary least squares interrupted time series analysis: a simulation study.
Samuel Hawley ... Klara Berencsi
Clinical Epidemiology | VOL. 11
Samuel Hawley, et. al.Samuel Hawley ... Klara Berencsi
01 Feb 2019
Clinical Epidemiology | VOL. 11

Study design and sample size considerations for half-life studies.
M Y Kim ... N Dubin
Archives of environmental contamination and toxicology | VOL. 30
M Y Kim, et. al.M Y Kim ... N Dubin
01 Mar 1996
Archives of environmental contamination and toxicology | VOL. 30

Simple and multiple linear regression: sample size considerations
James A Hanley
Journal of Clinical Epidemiology | VOL. 79
James A HanleyJames A Hanley
05 Jul 2016
Journal of Clinical Epidemiology | VOL. 79

A Call for Qualitative Power Analyses
Anthony J Onwuegbuzie ... Nancy L Leech
Quality & Quantity | VOL. 41
Anthony J Onwuegbuzie, et. al.Anthony J Onwuegbuzie ... Nancy L Leech
01 Feb 2007
Quality & Quantity | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Q-Learning: Theory and Applications

Abstract

Talk to us

Similar Papers

More From: Annual Review of Statistics and Its Application