Recovering Permuted Sequential Features for effective Reinforcement Learning.

Yi Jiang,Mingxiao Feng,Wengang Zhou,Houqiang Li

doi:10.1016/j.neunet.2024.106795

Abstract

When applying Reinforcement Learning (RL) to the real-world visual tasks, two major challenges necessitate consideration: sample inefficiency and limited generalization. To address the above two challenges, previous works focus primarily on learning semantic information from the visual state for improving sample efficiency, but they do not explicitly learn other valuable aspects, such as spatial information. Moreover, they improve generalization by learning representations that are invariant to alterations of task-irrelevant variables, without considering task-relevant variables. To enhance sample efficiency and generalization of the base RL algorithm in visual tasks, we propose an auxiliary task called Recovering Permuted Sequential Features (RPSF). Our method enhances generalization by learning the spatial structure information of the agent, which can mitigate the effects of changes in both task-relevant and task-irrelevant variables. Moreover, it explicitly learns both semantic and spatial information from the visual state by disordering and subsequently recovering a sequence of features to generate more holistic representations, thereby improving sample efficiency. Extensive experiments demonstrate that our method significantly improves the sample efficiency and generalization of the base RL algorithm and outperforms various state-of-the-art baselines across diverse tasks in unseen environments. Furthermore, our method exhibits compatibility with both CNN and Transformer architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recovering Permuted Sequential Features for effective Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: Neural networks : the official journal of the International Neural Network Society

Lead the way for us

Similar Papers

Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen ... Tung M Luu
-
Thanh Nguyen, et. al.Thanh Nguyen ... Tung M Luu
27 Sep 2021
27 Sep 2021

Foreground Object Detection in Visual Surveillance With Spatio-Temporal Fusion Network
Jae-Yeul Kim ... Jong-Eun Ha
IEEE Access | VOL. 10
Jae-Yeul Kim, et. al.Jae-Yeul Kim ... Jong-Eun Ha
01 Jan 2021
IEEE Access | VOL. 10

Sample Efficient Deep Reinforcement Learning With Online State Abstraction and Causal Transformer Model Prediction.
Yixing Lan ... Qiang Fang
IEEE transactions on neural networks and learning systems | VOL. PP
Yixing Lan, et. al.Yixing Lan ... Qiang Fang
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

SPNet: Dual-Branch Network with Spatial Supplementary Information for Building and Water Segmentation of Remote Sensing Images
Wenyu Zhao ... Youke Zhang
Remote Sensing | VOL. 16
Wenyu Zhao, et. al.Wenyu Zhao ... Youke Zhang
27 Aug 2024
Remote Sensing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recovering Permuted Sequential Features for effective Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: Neural networks : the official journal of the International Neural Network Society