Discounted Markov Decision Processes Research Articles

To explore the cost-effectiveness of telemedicine for the screening of diabetic retinopathy (DR) and identify changes within the demographics of a patient population after telemedicine implementation. A retrospective medical chart review (cohort study) was conducted. A total of 900 type 1 and type 2 diabetic patients enrolled in a medical system with a telemedicine screening program for DR. The cost-effectiveness of the DR telemedicine program was determined by using a finite-horizon, discrete time, discounted Markov decision process model populated by parameters and testing frequency obtained from patient records. The model estimated the progression of DR and determined average quality-adjusted life years (QALYs) saved and average additional cost incurred by the telemedicine screening program. Diabetic retinopathy, macular edema, blindness, and associated QALYs. The results indicate that telemedicine screening is cost-effective for DR under most conditions. On average, it is cost-effective for patient populations of >3500, patients aged <80 years, and all racial groups. Observable trends were identified in the screening population since the implementation of telemedicine screening: the number of known DR cases has increased, the overall age of patients receiving screenings has decreased, the percentage of nonwhites receiving screenings has increased, the average number of miles traveled by a patient to receive a screening has decreased, and the teleretinal screening participation is increasing. The current teleretinal screening program is effective in terms of being cost-effective and increasing population reach. Future screening policies should give consideration to the age of patients receiving screenings and the system's patient pool size because our results indicate it is not cost-effective to screen patients aged older than 80 years or in populations with <3500 patients.

Read full abstract

How should agents bid in repeated sequential auctions when they are budget constrained? A motivating example is that of sponsored search auctions, where advertisers bid in a sequence of generalized second price (GSP) auctions. These auctions, specifically in the context of sponsored search, have many idiosyncratic features that distinguish them from other models of sequential auctions: First, each bidder competes in a large number of auctions, where each auction is worth very little. Second, the total bidder population is often large, which means it is unrealistic to assume that the bidders could possibly optimize their strategy by modeling specific opponents. Third, the presence of a virtually unlimited supply of these auctions means bidders are necessarily expense constrained. Motivated by these three factors, we first frame the generic problem as a discounted Markov Decision Process for which the environment is independent and identically distributed over time. We also allow the agents to receive income to augment their budget at a constant rate. We first provide a structural characterization of the associated value function and the optimal bidding strategy, which specifies the extent to which agents underbid from their true valuation due to long term budget constraints. We then provide an explicit characterization of the optimal bid shading factor in the limiting regime where the discount rate tends to zero, by identifying the limit of the value function in terms of the solution to a differential equation that can be solved efficiently. Finally, we proved the existence of Mean Field Equilibria for both the repeated second price and GSP auctions with a large number of bidders.

Read full abstract

Discounted Markov Decision Processes Research Articles

Related Topics

Articles published on Discounted Markov Decision Processes

Uniqueness of optimal policies as a generic property of discounted Markov decision processes: Ekeland's variational principle approach

Regularized policy iteration with nonparametric function spaces

Hamiltonian cycle curves in the space of discounted occupational measures

Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes

A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

Feature selection and feature learning for high-dimensional batch reinforcement learning: A survey

Data-Driven Stochastic Models and Policies for Energy Harvesting Sensor Communications

Near-optimal PAC bounds for discounted MDPs

Value set iteration for Markov decision processes

Stochastic approximations of constrained discounted Markov decision processes

BLACKWELL OPTIMALITY IN STOCHASTIC GAMES

Policy set iteration for Markov decision processes

Evaluation of Telemedicine for Screening of Diabetic Retinopathy in the Veterans Health Administration

Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor

Nonuniqueness versus Uniqueness of Optimal Policies in Convex Discounted Markov Decision Processes

Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

Optimal Bidding Strategies and Equilibria in Dynamic Auctions with Budget Constraints

A mean–variance optimization problem for discounted Markov decision processes

A Version of the Euler Equation in Discounted Markov Decision Processes

Convex analytic approach to constrained discounted Markov decision processes with non-constant discount factors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Discounted Markov Decision Processes Research Articles

Related Topics

Articles published on Discounted Markov Decision Processes

Uniqueness of optimal policies as a generic property of discounted Markov decision processes: Ekeland's variational principle approach

Regularized policy iteration with nonparametric function spaces

Hamiltonian cycle curves in the space of discounted occupational measures

Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes

A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

Feature selection and feature learning for high-dimensional batch reinforcement learning: A survey

Data-Driven Stochastic Models and Policies for Energy Harvesting Sensor Communications

Near-optimal PAC bounds for discounted MDPs

Value set iteration for Markov decision processes

Stochastic approximations of constrained discounted Markov decision processes

BLACKWELL OPTIMALITY IN STOCHASTIC GAMES

Policy set iteration for Markov decision processes

Evaluation of Telemedicine for Screening of Diabetic Retinopathy in the Veterans Health Administration

Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor

Nonuniqueness versus Uniqueness of Optimal Policies in Convex Discounted Markov Decision Processes

Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

Optimal Bidding Strategies and Equilibria in Dynamic Auctions with Budget Constraints

A mean–variance optimization problem for discounted Markov decision processes

A Version of the Euler Equation in Discounted Markov Decision Processes

Convex analytic approach to constrained discounted Markov decision processes with non-constant discount factors