Comment on "Dynamic treatment regimes: technical challenges and applications"

Yair Goldberg,Michael R Kosorok,Donglin Zeng,Rui Song

doi:10.1214/14-ejs905

Abstract

Inference for parameters associated with optimal dynamic treatment regimes is challenging as these estimators are nonregular when there are non-responders to treatments. In this discussion, we comment on three aspects of alleviating this nonregularity. We first discuss an alternative approach for smoothing the quality functions. We then discuss some further details on our existing work to identify non-responders through penalization. Third, we propose a clinically meaningful value assessment whose estimator does not suffer from nonregularity.

Highlights

The authors are to be congratulated for their excellent and thoughtful paper on statistical inference for dynamic treatment regimens
We discuss replacing the nonsmooth objective functions via a SoftMax Q-learning approach, which directly addresses the trade-off between bias and variance of the maximum operation in the local asymptotic framework
We briefly describe the SoftMax Q-learning algorithm, and present some theoretical and simulation results

Summary

Introduction

The authors are to be congratulated for their excellent and thoughtful paper on statistical inference for dynamic treatment regimens. They have addressed several important and long-standing issues in this area. As discussed by the authors, nonsmoothness of the problem in some of the parameters of interest leads to estimators that are not smooth in the data. Nonregularity of the estimators for the parameters associated with the optimal treatment regimes is mainly due to the existence of non-responders to treatments. It would be useful and important if we could identify these non-responders. We claim that this alternative value function is clinically meaningful and does not suffer from nonregularity

SoftMax Q-learning

Proposed algorithm

Theory

Simulations for SoftMax

Penalized and adaptive Q-learning

Truncated value function

Concluding remarks

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2014
Citations: 14	License type: cc-by

R Discovery Prime

R Discovery Prime

Comment on "Dynamic treatment regimes: technical challenges and applications"

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Response to Reader Reaction
Baqun Zhang ... Eric B Laber
Biometrics | VOL. 71
Baqun Zhang, et. al.Baqun Zhang ... Eric B Laber
29 Oct 2014
Biometrics | VOL. 71

HIGH-DIMENSIONAL A-LEARNING FOR OPTIMAL DYNAMIC TREATMENT REGIMES.
Chengchun Shi ... Rui Song
The Annals of Statistics | VOL. 46
Chengchun Shi, et. al.Chengchun Shi ... Rui Song
03 May 2018
The Annals of Statistics | VOL. 46

Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part I: Main Content
Liliana Orellana ... Andrea Rotnitzky
The International Journal of Biostatistics | VOL. 6
Liliana Orellana, et. al.Liliana Orellana ... Andrea Rotnitzky
03 Jan 2010
The International Journal of Biostatistics | VOL. 6

Dynamic treatment regimes: technical challenges and applications.
Eric B Laber ... Min Qian
Electronic Journal of Statistics | VOL. 8
Eric B Laber, et. al.Eric B Laber ... Min Qian
01 Jan 2014
Electronic Journal of Statistics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comment on "Dynamic treatment regimes: technical challenges and applications"

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics