Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning

Anusha Nagabandi,Ronald S Fearing,Gregory Kahn,Sergey Levine

doi:10.1109/icra.2018.8463189

Abstract

Model-free deep reinforcement learning algorithms have been shown to be capable of learning a wide range of robotic skills, but typically require a very large number of samples to achieve good performance. Model-based algorithms, in principle, can provide for much more efficient learning, but have proven difficult to extend to expressive, high-capacity models such as deep neural networks. In this work, we demonstrate that neural network dynamics models can in fact be combined with model predictive control (MPC) to achieve excellent sample complexity in a model-based reinforcement learning algorithm, producing stable and plausible gaits that accomplish various complex locomotion tasks. We further propose using deep neural network dynamics models to initialize a model-free learner, in order to combine the sample efficiency of model-based approaches with the high task-specific performance of model-free methods. We empirically demonstrate on MuJoCo locomotion tasks that our pure model-based approach trained on just random action data can follow arbitrary trajectories with excellent sample efficiency, and that our hybrid algorithm can accelerate model-free learning on high-speed benchmark tasks, achieving sample efficiency gains of $3-5\times$ on swimmer, cheetah, hopper, and ant agents. Videos can be found at https://sites.google.com/view/mbmf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

PMA-DRL: A parallel model-augmented framework for deep reinforcement learning algorithms
Xufang Luo ... Yunhong Wang
Neurocomputing | VOL. 403
Xufang Luo, et. al.Xufang Luo ... Yunhong Wang
25 Apr 2020
Neurocomputing | VOL. 403

Improving Model-Based Deep Reinforcement Learning with Learning Degree Networks and Its Application in Robot Control
Guoqing Ma ... Zhifu Wang
Journal of Robotics | VOL. 2022
Guoqing Ma, et. al.Guoqing Ma ... Zhifu Wang
04 Mar 2022
Journal of Robotics | VOL. 2022

A Hybrid Deep Reinforcement Learning Algorithm for Intelligent Manipulation
Chao Ma ... Jie Bai
-
Chao Ma, et. al.Chao Ma ... Jie Bai
01 Jan 2019
01 Jan 2019

Variational Model-based Policy Optimization
Yinlam Chow ... Mohammad Ghavamzadeh
-
Yinlam Chow, et. al.Yinlam Chow ... Mohammad Ghavamzadeh
01 Aug 2021
01 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning

Abstract

Talk to us

Similar Papers