RePreM: Representation Pre-training with Masked Model for Reinforcement Learning

Yuanying Cai,Xuyun Zhang,Wei Shen,Chuheng Zhang,Longbo Huang,Wenjie Ruan

doi:10.1609/aaai.v37i6.25842

Abstract

Inspired by the recent success of sequence modeling in RL and the use of masked language model for pre-training, we propose a masked model for pre-training in RL, RePreM (Representation Pre-training with Masked Model), which trains the encoder combined with transformer blocks to predict the masked states or actions in a trajectory. RePreM is simple but effective compared to existing representation pre-training methods in RL. It avoids algorithmic sophistication (such as data augmentation or estimating multiple models) with sequence modeling and generates a representation that captures long-term dynamics well. Empirically, we demonstrate the effectiveness of RePreM in various tasks, including dynamic prediction, transfer learning, and sample-efficient RL with both value-based and actor-critic methods. Moreover, we show that RePreM scales well with dataset size, dataset quality, and the scale of the encoder, which indicates its potential towards big RL models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Conditional BERT Contextual Augmentation
Xing Wu ... Shangwen Lv
-
Xing Wu, et. al.Xing Wu ... Shangwen Lv
01 Jan 2019
01 Jan 2019

The impact of data augmentation and transfer learning on the performance of deep learning models for the segmentation of the hip on 3D magnetic resonance images
Eros Montin ... Riccardo Lattanzi
Informatics in medicine unlocked | VOL. 45
Eros Montin, et. al.Eros Montin ... Riccardo Lattanzi
01 Jan 2024
Informatics in medicine unlocked | VOL. 45

A novel masking model for Buddhist literature understanding by using Generative Adversarial Networks
Chaowen Yan ... Tao He
Expert Systems With Applications | VOL. 258
Chaowen Yan, et. al.Chaowen Yan ... Tao He
28 Aug 2024
Expert Systems With Applications | VOL. 258

Learning models in interdependence situations
...
-
, et. al. ...
18 Nov 2015
18 Nov 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence