Outcome trajectory estimation for optimal dynamic treatment regimes with repeated measures.

Yuan Zhang,Thomas A Murray,Lizbeth H Finestack,David M Vock,Megan E Patrick

doi:10.1093/jrsssc/qlad037

Outcome trajectory estimation for optimal dynamic treatment regimes with repeated measures.

Yuan Zhang, Thomas A Murray + Show 3 more

Open Access

https://doi.org/10.1093/jrsssc/qlad037

Copy DOI

Journal: Journal of the Royal Statistical Society Series C: Applied Statistics	Publication Date: May 22, 2023
Citations: 6

Affiliation: University of Pennsylvania, University of Minnesota, University of Michigan–Ann Arbor

#Dynamic Treatment Regimes #Inverse Probability Weighting + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In recent sequential multiple assignment randomized trials, outcomes were assessed multiple times to evaluate longer-term impacts of the dynamic treatment regimes (DTRs). Q-learning requires a scalar response to identify the optimal DTR. Inverse probability weighting may be used to estimate the optimal outcome trajectory, but it is inefficient, susceptible to model mis-specification, and unable to characterize how treatment effects manifest over time. We propose modified Q-learning with generalized estimating equations to address these limitations and apply it to the M-bridge trial, which evaluates adaptive interventions to prevent problematic drinking among college freshmen. Simulation studies demonstrate our proposed method improves efficiency and robustness.

Full Text