Data-efficient model-based reinforcement learning with trajectory discrimination

Tuo Qu,Fuqing Duan,Junge Zhang,Bo Zhao,Wenzhen Huang

doi:10.1007/s40747-023-01247-5

Abstract

Deep reinforcement learning has always been used to solve high-dimensional complex sequential decision problems. However, one of the biggest challenges for reinforcement learning is sample efficiency, especially for high-dimensional complex problems. Model-based reinforcement learning can solve the problem with a learned world model, but the performance is limited by the imperfect world model, so it usually has worse approximate performance than model-free reinforcement learning. In this paper, we propose a novel model-based reinforcement learning algorithm called World Model with Trajectory Discrimination (WMTD). We learn the representation of temporal dynamics information by adding a trajectory discriminator to the world model, and then compute the weight of state value estimation based on the trajectory discriminator to optimize the policy. Specifically, we augment the trajectories to generate negative samples and train a trajectory discriminator that shares the feature extractor with the world model. Experimental results demonstrate that our method improves the sample efficiency and achieves state-of-the-art performance on DeepMind control tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-efficient model-based reinforcement learning with trajectory discrimination

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Journal: Complex & Intelligent Systems	Publication Date: Oct 11, 2023
License type: CC BY 4.0

Similar Papers

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
Cheng Gao ... Dan Wang
Journal of Building Engineering | VOL. 74
Cheng Gao, et. al.Cheng Gao ... Dan Wang
01 Sep 2023
Journal of Building Engineering | VOL. 74

Accelerating Model-Free Reinforcement Learning With Imperfect Model Knowledge in Dynamic Spectrum Access
Lianjun Li ... Yang Yi
IEEE Internet of Things Journal | VOL. 7
Lianjun Li, et. al.Lianjun Li ... Yang Yi
01 Aug 2020
IEEE Internet of Things Journal | VOL. 7

Improving Model-Based Deep Reinforcement Learning with Learning Degree Networks and Its Application in Robot Control
Guoqing Ma ... Zhifu Wang
Journal of Robotics | VOL. 2022
Guoqing Ma, et. al.Guoqing Ma ... Zhifu Wang
04 Mar 2022
Journal of Robotics | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-efficient model-based reinforcement learning with trajectory discrimination

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems