Robust Markov Decision Process Research Articles

The recent development of industrial manufacturing and social services has witnessed a significant trend of automation and intelligentization due to the wide application of robots and the technology of artificial intelligence (AI). While robots liberate humans from tedious and dangerous work in hazardous environments, AI simplifies the programming of robots by automatically inferring patterns and models from the interaction between the robots and the environment. Nevertheless, the application of robots and AI to more general manufacturing and social tasks is still limited by the lack of flexibility and adaptability to the changes in the task and the environment. Thus, a new concept, adaptive robotics, has been proposed to address the desire that an AI-powered robot should be able to properly reprogram itself to these changes without human intervention. Nevertheless, this concept is yet too abstract to provide any specific guidance to the development of robot programs. In this paper, we attempt to provide methodical redefinition and reformulation of adaptive robotics both in conceptual and mathematical manners based on the study of previous results. First of all, we introduce the essential motivation and the conceptual origination of adaptability of mechanical systems. Then, we review the previous literature and explore the related work of adaptive robotics. Based on this, we provide a uniform mathematical formulation of adaptive robotics based on adaptive and robust Markov-decision process (MDP). Through this work, we attempt to inspire the generic framework of adaptive robotics incorporating the existing immature paradigms, by which we are aiming at a clarified and well-defined context of adaptive robotics for future research on related domains.

Read full abstract

Demand forecasting plays an important role in many inventory control problems. To mitigate the potential harms of model misspecification in this context, various forms of distributionally robust optimization have been applied. Although many of these methodologies suffer from the problem of time inconsistency, the work of Klabjan et al. established a general time-consistent framework for such problems by connecting to the literature on robust Markov decision processes. Motivated by the fact that many forecasting models exhibit very special structure as well as a desire to understand the impact of positing different dependency structures in distributionally robust multistage optimization, we formulate and solve a time-consistent distributionally robust multistage newsvendor model, which naturally robustifies some of the simplest inventory models with demand forecasting. In particular, in some of the simplest such models, demand evolves as a martingale (i.e., expected demand tomorrow equals realized demand today). We consider a robust variant of such models in which the sequence of future demands may be any martingale with given mean and support. Under such a model, past realizations of demand are naturally incorporated into the structure of the uncertainty set going forward. We explicitly compute the minimax optimal policy (and worst-case distribution) in closed form by combining ideas from convex analysis, probability, and dynamic programming. We prove that, at optimality, the worst-case demand distribution corresponds to the setting in which inventory may become obsolete at a random time. To gain further insight, we prove weak convergence (as the time horizon grows large) to a simple and intuitive process. We also compare with the analogous setting in which demand is independent across periods (analyzed previously by Shapiro) and identify several differences between these models in the spirit of the price of correlations studied by Agrawal et al. Finally, we complement our results by providing both numerical experiments that illustrate the potential benefits and limitations of our approach as well as additional theoretical analyses exploring what happens when our modeling assumptions do not hold.

Read full abstract

Robust Markov Decision Process Research Articles

Articles published on Robust Markov Decision Process

Bounding the difference between the values of robust and non-robust Markov decision problems

On the Convex Formulations of Robust Markov Decision Processes

Robust Average-Reward Reinforcement Learning

Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization

Robust Average-Reward Markov Decision Processes

Decision-Dependent Distributionally Robust Markov Decision Process Method in Dynamic Epidemic Control

Data-driven remanufacturing planning with parameter uncertainty

Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics

A methodical interpretation of adaptive robotics: Study and reformulation

Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

Robust Markov Decision Processes with Data-Driven, Distance-Based Ambiguity Sets

Distributionally Robust Inventory Control When Demand Is a Martingale

Distributionally Robust Markov Decision Processes and Their Connection to Risk Measures

Scalable First-Order Methods for Robust MDPs

Markov decision processes with recursive risk measures

Robust Deep Reinforcement Learning for Quadcopter Control

Stochastic Energy Management of Electric Bus Charging Stations With Renewable Energy Integration and B2G Capabilities

Stochastic and Distributionally Robust Load Ensemble Control

Distributionally robust optimization for sequential decision-making

An active-set strategy to solve Markov decision processes with good-deal risk measure

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Robust Markov Decision Process Research Articles

Articles published on Robust Markov Decision Process

Bounding the difference between the values of robust and non-robust Markov decision problems

On the Convex Formulations of Robust Markov Decision Processes

Robust Average-Reward Reinforcement Learning

Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization

Robust Average-Reward Markov Decision Processes

Decision-Dependent Distributionally Robust Markov Decision Process Method in Dynamic Epidemic Control

Data-driven remanufacturing planning with parameter uncertainty

Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics

A methodical interpretation of adaptive robotics: Study and reformulation

Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

Robust Markov Decision Processes with Data-Driven, Distance-Based Ambiguity Sets

Distributionally Robust Inventory Control When Demand Is a Martingale

Distributionally Robust Markov Decision Processes and Their Connection to Risk Measures

Scalable First-Order Methods for Robust MDPs

Markov decision processes with recursive risk measures

Robust Deep Reinforcement Learning for Quadcopter Control

Stochastic Energy Management of Electric Bus Charging Stations With Renewable Energy Integration and B2G Capabilities

Stochastic and Distributionally Robust Load Ensemble Control

Distributionally robust optimization for sequential decision-making

An active-set strategy to solve Markov decision processes with good-deal risk measure