Optimal Dynamic Treatment Regimes Research Articles

The wide-scale adoption of electronic health records (EHRs) provides extensive information to support precision medicine and personalized health care. In addition to structured EHRs, we leverage free-text clinical information extraction (IE) techniques to estimate optimal dynamic treatment regimes (DTRs), a sequence of decision rules that dictate how to individualize treatments to patients based on treatment and covariate history. The proposed IE of patient characteristics closely resembles "The clinical Text Analysis and Knowledge Extraction System" and employs named entity recognition, boundary detection, and negation annotation. It also utilizes regular expressions to extract numerical information. Combining the proposed IE with optimal DTR estimation, we extract derived patient characteristics and use tree-based reinforcement learning (T-RL) to estimate multistage optimal DTRs. IE significantly improved the estimation in counterfactual outcome models compared to using structured EHR data alone, which often include incomplete data, data entry errors, and other potentially unobserved risk factors. Moreover, including IE in optimal DTR estimation provides larger study cohorts and a broader pool of candidate tailoring variables. We demonstrate the performance of our proposed method via simulations and an application using clinical records to guide blood pressure control treatments among critically ill patients with severe acute hypertension. This joint estimation approach improves the accuracy of identifying the optimal treatment sequence by 14-24% compared to traditional inference without using IE, based on our simulations over various scenarios. In the blood pressure control application, we successfully extracted significant blood pressure predictors that are unobserved or partially missing from structuredEHR.

Read full abstract

A main research goal in various studies is to use an observational data set and provide a new set of counterfactual guidelines that can yield causal improvements. Dynamic Treatment Regimes (DTRs) are widely studied to formalize this process. However, available methods in finding optimal DTRs often rely on assumptions that are violated in real-world applications (e.g., medical decision-making or public policy), especially when (a) the existence of unobserved confounders cannot be ignored, and (b) the unobserved confounders are time-varying (e.g., affected by previous actions). When such assumptions are violated, one often faces ambiguity regarding the underlying causal model that is needed to be assumed to obtain an optimal DTR. This ambiguity is inevitable, since the dynamics of unobserved confounders and their causal impact on the observed part of the data cannot be understood from the observed data. Motivated by a case study of finding superior treatment regimes for patients who underwent transplantation in our partner hospital and faced a medical condition known as New Onset Diabetes After Transplantation (NODAT), we extend DTRs to a new class termed Ambiguous Dynamic Treatment Regimes (ADTRs), in which the casual impact of treatment regimes is evaluated based on a "cloud" of potential causal models. We then connect ADTRs to Ambiguous Partially Observable Mark Decision Processes (APOMDPs) proposed by Saghafian (2018), and develop two Reinforcement Learning methods termed Direct Augmented V-Learning (DAV-Learning) and Safe Augmented V-Learning (SAV-Learning), which enable using the observed data to efficiently learn an optimal treatment regime. We establish theoretical results for these learning methods, including (weak) consistency and asymptotic normality. We further evaluate the performance of these learning methods both in our case study and in simulation experiments.

Read full abstract

Optimal Dynamic Treatment Regimes Research Articles

Articles published on Optimal Dynamic Treatment Regimes

Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning

Penalized Spline-Involved Tree-based (PenSIT) Learning for estimating an optimal dynamic treatment regime using observational data.

On estimation and cross-validation of dynamic treatment regimes with competing risks.

Personalized Dynamic Treatment Regimes in Continuous Time: A Bayesian Approach for Optimizing Clinical Decisions with Timing

Deconfounding Actor-Critic Network with Policy Adaptation for Dynamic Treatment Regimes.

Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring.

Doubly robust estimation of optimal dynamic treatmentregimes with multicategory treatments andsurvival outcomes.

Generalization error bounds of dynamic treatment regimes in penalized regression-based learning

Estimation in regret-regression using quadratic inference functions with ridge estimator.

Semiparametric Bayesian inference for optimal dynamic treatment regimes via dynamic marginal structural models.

Bayesian set of best dynamic treatment regimes: Construction and sample size calculation for SMARTswithbinary outcomes.

Optimal dynamic treatment regime estimation using information extraction from unstructured clinical text.

Learning and Assessing Optimal Dynamic Treatment Regimes Through Cooperative Imitation Learning

Restricted sub-tree learning to estimate an optimal dynamic treatment regime using observational data.

Adaptive randomization in a two-stage sequential multiple assignment randomized trial.

Multicategory Angle-Based Learning for Estimating Optimal Dynamic Treatment Regimes With Censored Data

Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach

PDB65 Development of Optimal Dynamic Treatment Regime to Enhance Adherence of Patients with Type 2 Diabetes with Q-Learning Using Claims DATA

Stochastic Tree Search for Estimating Optimal Dynamic Treatment Regimes

Finite sample variance estimation for optimal dynamic treatment regimes of survival outcomes.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Optimal Dynamic Treatment Regimes Research Articles

Articles published on Optimal Dynamic Treatment Regimes

Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning

Penalized Spline-Involved Tree-based (PenSIT) Learning for estimating an optimal dynamic treatment regime using observational data.

On estimation and cross-validation of dynamic treatment regimes with competing risks.

Personalized Dynamic Treatment Regimes in Continuous Time: A Bayesian Approach for Optimizing Clinical Decisions with Timing

Deconfounding Actor-Critic Network with Policy Adaptation for Dynamic Treatment Regimes.

Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring.

Doubly robust estimation of optimal dynamic treatmentregimes with multicategory treatments andsurvival outcomes.

Generalization error bounds of dynamic treatment regimes in penalized regression-based learning

Estimation in regret-regression using quadratic inference functions with ridge estimator.

Semiparametric Bayesian inference for optimal dynamic treatment regimes via dynamic marginal structural models.

Bayesian set of best dynamic treatment regimes: Construction and sample size calculation for SMARTswithbinary outcomes.

Optimal dynamic treatment regime estimation using information extraction from unstructured clinical text.

Learning and Assessing Optimal Dynamic Treatment Regimes Through Cooperative Imitation Learning

Restricted sub-tree learning to estimate an optimal dynamic treatment regime using observational data.

Adaptive randomization in a two-stage sequential multiple assignment randomized trial.

Multicategory Angle-Based Learning for Estimating Optimal Dynamic Treatment Regimes With Censored Data

Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach

PDB65 Development of Optimal Dynamic Treatment Regime to Enhance Adherence of Patients with Type 2 Diabetes with Q-Learning Using Claims DATA

Stochastic Tree Search for Estimating Optimal Dynamic Treatment Regimes

Finite sample variance estimation for optimal dynamic treatment regimes of survival outcomes.