Generalization error bounds of dynamic treatment regimes in penalized regression-based learning

Eun Jeong Oh,Ying Kuen Cheung,Min Qian

doi:10.1214/22-aos2171

Abstract

A dynamic treatment regime (DTR) is a sequence of decision rules, one per stage of intervention, that maps up-to-date patient information to a recommended treatment. Discovering an appropriate DTR for a given disease is a challenging issue especially when a large set of prognostic variables are observed. To address this problem, we propose penalized regression-based learning methods with l1 penalty to estimate the optimal DTR that would maximize the expected outcome if implemented. We also provide generalization error bounds of the estimated DTR in the setting of finite number of stages with multiple treatment options. We first examine the relationship between value and Q-functions and derive a finite sample upper bound on the difference in values between the optimal and the estimated DTRs. For practical implementation, we develop an algorithm with partial regularization via orthogonality to construct the optimal DTR. The advantages of the proposed methods are demonstrated with extensive simulation studies and data analysis of depression clinical trials.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalization error bounds of dynamic treatment regimes in penalized regression-based learning

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics

Lead the way for us

Similar Papers

Dynamic treatment regimes: technical challenges and applications.
Eric B Laber ... Min Qian
Electronic Journal of Statistics | VOL. 8
Eric B Laber, et. al.Eric B Laber ... Min Qian
01 Jan 2014
Electronic Journal of Statistics | VOL. 8

C-learning: A new classification framework to estimate optimal dynamic treatment regimes.
Baqun Zhang ... Min Zhang
Biometrics | VOL. 74
Baqun Zhang, et. al.Baqun Zhang ... Min Zhang
11 Dec 2017
Biometrics | VOL. 74

Response to Reader Reaction
Baqun Zhang ... Eric B Laber
Biometrics | VOL. 71
Baqun Zhang, et. al.Baqun Zhang ... Eric B Laber
29 Oct 2014
Biometrics | VOL. 71

Use of personalized Dynamic Treatment Regimes (DTRs) and Sequential Multiple Assignment Randomized Trials (SMARTs) in mental health studies.
Ying Liu ... Yuanjia Wang
Shanghai Archives of Psychiatry | VOL. 26
Ying Liu, et. al.Ying Liu ... Yuanjia Wang
01 Dec 2014
Shanghai Archives of Psychiatry | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalization error bounds of dynamic treatment regimes in penalized regression-based learning

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics