Initial State Distribution Research Articles

Problem definition: Effective hypertension management is critical to reducing the consequences of atherosclerotic cardiovascular disease, a leading cause of death in the United States. Clinical guidelines for hypertension can be enhanced using decision-analytic approaches capable of capturing complexities in treatment planning. However, model-generated recommendations may be uninterpretable/unintuitive, limiting their clinical acceptability. We address this challenge by investigating interpretable treatment plans. Methodology/results: We formulate interpretable treatment plans as Markov decision processes (MDPs) and analyze the problems of optimizing monotone policies, which prohibit decreasing treatment intensity for sicker patients, and class-ordered monotone policies, which generalize monotone policies. We establish that both policies depend on initial state distributions and that optimal monotone policies can be generated tractably for many treatment planning problems. Next, we propose exact formulations for optimizing interpretable policies broadly. Then, we analyze the price of interpretability, proving that the class-ordered monotone policy’s price of interpretability does not exceed the monotone policy’s price of interpretability. Finally, we formulate and evaluate MDPs for hypertension treatment planning using a large nationally representative data set of the U.S. population. We compare the structure and performance of optimal monotone policies and class-ordered monotone policies with optimal MDP-based policies and current clinical guidelines. At the patient level, optimal MDP-based policies may be unintuitive, recommending more aggressive treatment for healthier patients than sicker patients. Conversely, monotone policies and class-ordered monotone policies never deescalate treatment, reflecting clinical intuition. Across 66.5 million patients, optimized monotone policies and class-ordered monotone policies outperform clinical guidelines, saving over 3,246 quality-adjusted life years per 100,000 patients, with both policies paying a low price of interpretability. Sensitivity analysis illustrates that monotone policies and class-ordered monotone policies are robust to various definitions of “interpretability.” Managerial implications: Interpretable policies can be tractably optimized, drastically outperform existing guidelines, and perform near optimally—potentially increasing the acceptability of decision-analytic approaches in practice. Funding: L. N. Steimle and W. J. Marrero received support from the National Science Foundation Graduate Research Fellowship [Grant DGE 1256260]. J. B. Sussman received support from the National Institutes of Health [Grants R01NS102715 and RF1AG068410], the U.S. Department of Veterans Affairs [Grants 1I01-HX003304 and 1I50-HX003251], and the Michigan Department of Health and Human Services. Supplemental Material: The e-companion is available at https://doi.org/10.1287/msom.2021.0373 .

Read full abstract

PurposeClinical trials have produced promising results for disease-modifying therapies (DMTs) for Alzheimer’s disease (AD); however, the evidence on their potential cost-effectiveness is limited. This study assesses the cost-effectiveness of a hypothetical DMT with a limited treatment duration in AD. MethodsWe developed a Markov state–transition model to estimate the cost-effectiveness of a hypothetical DMT plus best supportive care (BSC) versus BSC alone among Americans living with mild cognitive impairment (MCI) due to AD or mild AD. AD states included MCI due to AD, mild AD, moderate AD, severe AD, and death. A hypothetical DMT was assumed to confer a 30% reduction in progression from MCI and mild AD. The base case annual drug acquisition cost was assumed to be $56,000. Other medical and indirect costs were obtained from published literature or list prices. Utilities for patients and caregivers were obtained from the published literature and varied by AD state and care setting (community care or long-term care). We considered 3 DMT treatment strategies: (1) treatment administered until patients reached severe AD (continuous strategy), (2) treatment administered for a maximum duration of 18 months or when patients reached severe AD (fixed-duration strategy), and (3) 40% of patients discontinuing treatment at 6 months because of amyloid plaque clearance and the remaining patients continuing treatment until 18 months or until they reached severe AD (test-and-discontinue strategy). Incremental cost-effectiveness ratios (ICERs) were calculated as the incremental cost per quality-adjusted life-year (QALY) gained. FindingsFrom the health care sector perspective, continuous treatment with a hypothetical DMT versus BSC resulted in an ICER of $612,354 per QALY gained. The ICER decreased to $157,288 per QALY gained in the fixed-duration strategy, driven by large reductions in treatment costs. With 40% of patients discontinuing treatment at 6 months (test-and-discontinue strategy), the ICER was $125,631 per QALY gained. In sensitivity and scenario analyses, the ICER was the most sensitive to changes in treatment efficacy, treatment cost, and the initial population AD state distribution. From the modified societal perspective, ICERs were 6.3%, 20.4%, and 25.1% lower than those from the health care sector perspective for the continuous, fixed-duration, and test-and-discontinue strategies, respectively. ImplicationsUnder a set of assumptions for annual treatment costs and the magnitude and duration of treatment efficacy, DMTs used for a limited duration may deliver value consistent with accepted US cost-effectiveness thresholds.

Read full abstract

Initial State Distribution Research Articles

Related Topics

Articles published on Initial State Distribution

Global Optimality Guarantees for Policy Gradient Methods

A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots.

Interpretable Policies and the Price of Interpretability in Hypertension Treatment Planning

Distributed consensus-based multi-agent temporal-difference learning

Stochastic Magnetic Actuated Random Transducer Devices Based on Perpendicular Magnetic Tunnel Junctions

Impact of Salinity on the Energy Transfer between Pigment-Protein Complexes in Photosynthetic Apparatus, Functions of the Oxygen-Evolving Complex and Photochemical Activities of Photosystem II and Photosystem I in Two Paulownia Lines.

Softmax policy gradient methods can take exponential time to converge

A Neural Network Approach for High-Dimensional Optimal Control Applied to Multiagent Path Finding

A Data-Efficient Training Method for Deep Reinforcement Learning

Hybrid discrete-continuous truncated Wigner approximation for driven, dissipative spin systems

Assessing the Cost-effectiveness of a Hypothetical Disease-modifying Therapy With Limited Duration for the Treatment of Early Symptomatic Alzheimer Disease

Automating Reinforcement Learning With Example-Based Resets

Constrained Discounted Stochastic Games

Machine learning product state distributions from initial reactant states for a reactive atom-diatom collision system.

Formulation of temperature dependent effective Hartree potential incorporating quadratic over linear molecular DOFs-surface modes couplings and its effect on quantum dynamics of D2 (v = 0, j = 0)/D2 (v = 0, j = 2) on Cu(111) metal surface

Geant4 simulation of neutron displacement damage effect in InP

Discrete-time inverse linear quadratic optimal control over finite time-horizon under noisy output measurements

Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes

Brief Paper: Augmentation of Hidden Markov Chain for Complex Sequential Data in Context

Effect of surface temperature on quantum dynamics of H2 on Cu(111) using a chemically accurate potential energy surface.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Initial State Distribution Research Articles

Related Topics

Articles published on Initial State Distribution

Global Optimality Guarantees for Policy Gradient Methods

A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots.

Interpretable Policies and the Price of Interpretability in Hypertension Treatment Planning

Distributed consensus-based multi-agent temporal-difference learning

Stochastic Magnetic Actuated Random Transducer Devices Based on Perpendicular Magnetic Tunnel Junctions

Impact of Salinity on the Energy Transfer between Pigment-Protein Complexes in Photosynthetic Apparatus, Functions of the Oxygen-Evolving Complex and Photochemical Activities of Photosystem II and Photosystem I in Two Paulownia Lines.

Softmax policy gradient methods can take exponential time to converge

A Neural Network Approach for High-Dimensional Optimal Control Applied to Multiagent Path Finding

A Data-Efficient Training Method for Deep Reinforcement Learning

Hybrid discrete-continuous truncated Wigner approximation for driven, dissipative spin systems

Assessing the Cost-effectiveness of a Hypothetical Disease-modifying Therapy With Limited Duration for the Treatment of Early Symptomatic Alzheimer Disease

Automating Reinforcement Learning With Example-Based Resets

Constrained Discounted Stochastic Games

Machine learning product state distributions from initial reactant states for a reactive atom-diatom collision system.

Formulation of temperature dependent effective Hartree potential incorporating quadratic over linear molecular DOFs-surface modes couplings and its effect on quantum dynamics of D2 (v = 0, j = 0)/D2 (v = 0, j = 2) on Cu(111) metal surface

Geant4 simulation of neutron displacement damage effect in InP

Discrete-time inverse linear quadratic optimal control over finite time-horizon under noisy output measurements

Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes

Brief Paper: Augmentation of Hidden Markov Chain for Complex Sequential Data in Context

Effect of surface temperature on quantum dynamics of H2 on Cu(111) using a chemically accurate potential energy surface.