TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.

Yebin Tao,Lu Wang,Daniel Almirall

doi:10.1214/18-aoas1137

Abstract

Dynamic treatment regimes (DTRs) are sequences of treatment decision rules, in which treatment may be adapted over time in response to the changing course of an individual. Motivated by the substance use disorder (SUD) study, we propose a tree-based reinforcement learning (T-RL) method to directly estimate optimal DTRs in a multi-stage multi-treatment setting. At each stage, T-RL builds an unsupervised decision tree that directly handles the problem of optimization with multiple treatment comparisons, through a purity measure constructed with augmented inverse probability weighted estimators. For the multiple stages, the algorithm is implemented recursively using backward induction. By combining semiparametric regression with flexible tree-based learning, T-RL is robust, efficient and easy to interpret for the identification of optimal DTRs, as shown in the simulation studies. With the proposed method, we identify dynamic SUD treatment regimes for adolescents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.

Abstract

Talk to us

Similar Papers

More From: The Annals of Applied Statistics

Lead the way for us

Journal: The Annals of Applied Statistics	Publication Date: Sep 1, 2018
Citations: 35

Similar Papers

Response to Reader Reaction
Baqun Zhang ... Eric B Laber
Biometrics | VOL. 71
Baqun Zhang, et. al.Baqun Zhang ... Eric B Laber
29 Oct 2014
Biometrics | VOL. 71

Predictive Bayesian inference and dynamic treatment regimes.
Olli Saarela ... Erica E M Moodie
Biometrical Journal | VOL. 57
Olli Saarela, et. al.Olli Saarela ... Erica E M Moodie
11 Aug 2015
Biometrical Journal | VOL. 57

Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.
Yebin Tao ... Lu Wang
Biometrics | VOL. 73
Yebin Tao, et. al.Yebin Tao ... Lu Wang
23 May 2016
Biometrics | VOL. 73

Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
Soroush Saghafian
Management Science | VOL. -
Soroush SaghafianSoroush Saghafian
04 Oct 2023
Management Science | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.

Abstract

Talk to us

Similar Papers

More From: The Annals of Applied Statistics