Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach

Soroush Saghafian

doi:10.2139/ssrn.3980837

Abstract

A main research goal in various studies is to use an observational data set and provide a new set of counterfactual guidelines that can yield causal improvements. Dynamic Treatment Regimes (DTRs) are widely studied to formalize this process. However, available methods in finding optimal DTRs often rely on assumptions that are violated in real-world applications (e.g., medical decision-making or public policy), especially when (a) the existence of unobserved confounders cannot be ignored, and (b) the unobserved confounders are time-varying (e.g., affected by previous actions). When such assumptions are violated, one often faces ambiguity regarding the underlying causal model that is needed to be assumed to obtain an optimal DTR. This ambiguity is inevitable, since the dynamics of unobserved confounders and their causal impact on the observed part of the data cannot be understood from the observed data. Motivated by a case study of finding superior treatment regimes for patients who underwent transplantation in our partner hospital and faced a medical condition known as New Onset Diabetes After Transplantation (NODAT), we extend DTRs to a new class termed Ambiguous Dynamic Treatment Regimes (ADTRs), in which the casual impact of treatment regimes is evaluated based on a "cloud" of potential causal models. We then connect ADTRs to Ambiguous Partially Observable Mark Decision Processes (APOMDPs) proposed by Saghafian (2018), and develop two Reinforcement Learning methods termed Direct Augmented V-Learning (DAV-Learning) and Safe Augmented V-Learning (SAV-Learning), which enable using the observed data to efficiently learn an optimal treatment regime. We establish theoretical results for these learning methods, including (weak) consistency and asymptotic normality. We further evaluate the performance of these learning methods both in our case study and in simulation experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal

Lead the way for us

Journal: SSRN Electronic Journal	Publication Date: Jan 1, 2021
Citations: 2

Similar Papers

Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
Soroush Saghafian
Management Science | VOL. -
Soroush SaghafianSoroush Saghafian
04 Oct 2023
Management Science | VOL. -

Response to Reader Reaction
Baqun Zhang ... Marie Davidian
Biometrics | VOL. 71
Baqun Zhang, et. al.Baqun Zhang ... Marie Davidian
29 Oct 2014
Biometrics | VOL. 71

New onset diabetes after transplantation: unravelling the pathophysiological process
Jennifer A Mccaughan ... Alexander P Maxwell
The Lancet | VOL. 383
Jennifer A Mccaughan, et. al.Jennifer A Mccaughan ... Alexander P Maxwell
01 Feb 2014
The Lancet | VOL. 383

Influence of interleukin-2 receptor antagonists on the morbidity and prognosis of new-onset diabetes after liver transplantation
...
Chinese Journal of Endocrinology and Metabolism | VOL. 34
, et. al. ...
25 Feb 2018
Chinese Journal of Endocrinology and Metabolism | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal