An Experimental Study on State Representation Extraction for Vision-Based Deep Reinforcement Learning

Junkai Ren,Yujun Zeng,Sihang Zhou,Yichuan Zhang

doi:10.3390/app112110337

Junkai Ren, Yujun Zeng + Show 2 more

Open Access

https://doi.org/10.3390/app112110337

Copy DOI

Journal: Applied Sciences	Publication Date: Nov 3, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: National University of Defense Technology

Abstract

Scaling end-to-end learning to control robots with vision inputs is a challenging problem in the field of deep reinforcement learning (DRL). While achieving remarkable success in complex sequential tasks, vision-based DRL remains extremely data-inefficient, especially when dealing with high-dimensional pixels inputs. Many recent studies have tried to leverage state representation learning (SRL) to break through such a barrier. Some of them could even help the agent learn from pixels as efficiently as from states. Reproducing existing work, accurately judging the improvements offered by novel methods, and applying these approaches to new tasks are vital for sustaining this progress. However, the demands of these three aspects are seldom straightforward. Without significant criteria and tighter standardization of experimental reporting, it is difficult to determine whether improvements over the previous methods are meaningful. For this reason, we conducted ablation studies on hyperparameters, embedding network architecture, embedded dimension, regularization methods, sample quality and SRL methods to compare and analyze their effects on representation learning and reinforcement learning systematically. Three evaluation metrics are summarized, including five baseline algorithms (including both value-based and policy-based methods) and eight tasks are adopted to avoid the particularity of each experiment setting. We highlight the variability in reported methods and suggest guidelines to make future results in SRL more reproducible and stable based on a wide number of experimental analyses. We aim to spur discussion about how to assure continued progress in the field by minimizing wasted effort stemming from results that are non-reproducible and easily misinterpreted.

Highlights

Deep Reinforcement Learning is an emerging subfield of Reinforcement Learning (RL) that relies on deep neural networks as a function approximator, enabling RL algorithms in complex environments
The performance of learnt latent feature representations is related to the selected deep reinforcement learning (DRL) algorithm
1 regularization technique has a computational benefit since zero-coefficient features can be avoided, it is discovered that 1-norm based state representation learning (SRL) is not necessary to produce positive results as expected

Summary

Introduction

Deep Reinforcement Learning is an emerging subfield of Reinforcement Learning (RL) that relies on deep neural networks as a function approximator, enabling RL algorithms in complex environments. Unlike in classic reinforcement learning where human-crafted representation is used, vision-based DRL has to learn features directly from raw observations, in addition to policy learning; on the other hand, most RL approaches assume a fully observable state space, i.e., fully observable Markov Decision Processes (MDPs). This assumption is unworkable in real-world robotics due to factors including sensor sensitivity limitations and sensor noise and the lack of knowledge about whether the observation design is complete or not. Vision-based DRL typically suffers from slow learning speeds and frequently requires an excessive amount of training time and data to attain desired performance, making it unsuitable to real-world situations where data collection is difficlut and expensive

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Experimental Study on State Representation Extraction for Vision-Based Deep Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Evaluating Learned State Representations for Atari
Adam Tupper ... Kourosh Neshatian
-
Adam Tupper, et. al.Adam Tupper ... Kourosh Neshatian
25 Nov 2020
25 Nov 2020

Integrating State Representation Learning Into Deep Reinforcement Learning
Tim De Bruin ... Robert Babuska
IEEE Robotics and Automation Letters | VOL. 3
Tim De Bruin, et. al.Tim De Bruin ... Robert Babuska
01 Jul 2018
IEEE Robotics and Automation Letters | VOL. 3

Improving multi-goal and target-driven reinforcement learning with supervised auxiliary task
Luiz R T Horita ... Angelica T M Nakamura
-
Luiz R T Horita, et. al.Luiz R T Horita ... Angelica T M Nakamura
06 Dec 2021
06 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Experimental Study on State Representation Extraction for Vision-Based Deep Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences