An Overview of Robust Reinforcement Learning

Shiyu Chen,Yanjie Li

doi:10.1109/icnsc48988.2020.9238129

Abstract

Reinforcement learning (RL) is one of the popular methods for intelligent control and decision making in the field of robotics recently. The goal of RL is to learn an optimal policy of the agent by interacting with the environment via trail and error. There are two main algorithms for RL problems, including model-free and model-based methods. Model-free RL is driven by historical trajectories and empirical data of the agent to optimize the policy, which needs to take actions in the environment to collect the trajectory data and may cause the damage of the robot during training in the real environment. The main different between model-based and model-free RL is that a model of the transition probability in the interaction environment is employed. Thus the agent can search the optimal policy through internal simulation. However, the model of the transition probability is usually estimated from historical data in a single environment with statistical errors. Therefore, an issue is faced by the agent is that the optimal policy is sensitive to perturbations in the model of the environment which can lead to serious degradation in performance. Robust RL aims to learn a robust optimal policy that accounts for model uncertainty of the transition probability to systematically mitigate the sensitivity of the optimal policy in perturbed environments. In this overview, we begin with an introduction to the algorithms in RL, then focus on the model uncertainty of the transition probability in robust RL. In parallel, we highlight the current research and challenges of robust RL for robot control. To conclude, we describe some research areas in robust RL and look ahead to the future work about robot control in complex environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Overview of Robust Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
Cheng Gao ... Dan Wang
Journal of Building Engineering | VOL. 74
Cheng Gao, et. al.Cheng Gao ... Dan Wang
01 Sep 2023
Journal of Building Engineering | VOL. 74

Towards Generalization and Efficiency in Reinforcement Learning

-

02 Jul 2019
02 Jul 2019

Navigating complex decision spaces: Problems and paradigms in sequential choice.
Matthew M Walsh ... John R Anderson
Psychological Bulletin | VOL. 140
Matthew M Walsh, et. al.Matthew M Walsh ... John R Anderson
01 Jan 2014
Psychological Bulletin | VOL. 140

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Overview of Robust Reinforcement Learning

Abstract

Talk to us

Similar Papers