Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Xuelin Huang,Peter F Thall,Sangbum Choi,Lu Wang

doi:10.1002/sim.6558

Abstract

In medical therapies involving multiple stages, a physician's choice of a subject's treatment at each stage depends on the subject's history of previous treatments and outcomes. The sequence of decisions is known as a dynamic treatment regime or treatment policy. We consider dynamic treatment regimes in settings where each subject's final outcome can be defined as the sum of longitudinally observed values, each corresponding to a stage of the regime. Q-learning, which is a backward induction method, is used to first optimize the last stage treatment then sequentially optimize each previous stage treatment until the first stage treatment is optimized. During this process, model-based expectations of outcomes of late stages are used in the optimization of earlier stages. When the outcome models are misspecified, bias can accumulate from stage to stage and become severe, especially when the number of treatment stages is large. We demonstrate that a modification of standard Q-learning can help reduce the accumulated bias. We provide a computational algorithm, estimators, and closed-form variance formulas. Simulation studies show that the modified Q-learning method has a higher probability of identifying the optimal treatment regime even in settings with misspecified models for outcomes. It is applied to identify optimal treatment regimes in a study for advanced prostate cancer and to estimate and compare the final mean rewards of all the possible discrete two-stage treatment sequences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Abstract

Talk to us

Similar Papers

More From: Statistics in Medicine

Lead the way for us

Journal: Statistics in Medicine	Publication Date: Jun 21, 2015
Citations: 52

Similar Papers

Improved Doubly Robust Estimation in Marginal Mean Models for Dynamic Regimes
Hao Sun ... Xin Lu
Journal of Causal Inference | VOL. 8
Hao Sun, et. al.Hao Sun ... Xin Lu
01 Jan 2020
Journal of Causal Inference | VOL. 8

Evaluating Joint Effects of Induction–Salvage Treatment Regimes on Overall Survival in Acute Leukaemia
Abdus S Wahed ... Peter F Thall
Journal of the Royal Statistical Society Series C: Applied Statistics | VOL. 62
Abdus S Wahed, et. al.Abdus S Wahed ... Peter F Thall
27 Jul 2012
Journal of the Royal Statistical Society Series C: Applied Statistics | VOL. 62

Discussion of “Dynamic treatment regimes: Technical challenges and applications”
Jesse Y Hsu ... Dylan S Small
Electronic Journal of Statistics | VOL. 8
Jesse Y Hsu, et. al.Jesse Y Hsu ... Dylan S Small
01 Jan 2014
Electronic Journal of Statistics | VOL. 8

Treatment-competing events in dynamic regimes
Brent A Johnson
Lifetime Data Analysis | VOL. 14
Brent A JohnsonBrent A Johnson
09 Sep 2007
Lifetime Data Analysis | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Abstract

Talk to us

Similar Papers

More From: Statistics in Medicine