Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Fabio Pardo,Petar Kormushev,Vitaly Levdik

doi:10.1609/aaai.v34i04.5983

Abstract

Being able to reach any desired location in the environment can be a valuable asset for an agent. Learning a policy to navigate between all pairs of states individually is often not feasible. An all-goals updating algorithm uses each transition to learn Q-values towards all goals simultaneously and off-policy. However the expensive numerous updates in parallel limited the approach to small tabular cases so far. To tackle this problem we propose to use convolutional network architectures to generate Q-values and updates for a large number of goals at once. We demonstrate the accuracy and generalization qualities of the proposed method on randomly generated mazes and Sokoban puzzles. In the case of on-screen goal coordinates the resulting mapping from frames to distance-maps directly informs the agent about which places are reachable and in how many steps. As an example of application we show that replacing the random actions in ε-greedy exploration by several actions towards feasible goals generates better exploratory trajectories on Montezuma's Revenge and Super Mario All-Stars games.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 3

Similar Papers

Semantic segmentation of bioimages using convolutional neural networks
Stiaan Wiehman ... Hendrik De Villiers
-
Stiaan Wiehman, et. al.Stiaan Wiehman ... Hendrik De Villiers
01 Jul 2016
01 Jul 2016

Convolutional and Recurrent Neural Networks
Umberto Michelucci
-
Umberto MichelucciUmberto Michelucci
01 Jan 2018
01 Jan 2018

Research on improved convolutional wavelet neural network
Jingwei Liu ... Peixuan Li
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Peixuan Li
09 Sep 2021
Scientific Reports | VOL. 11

Research on Medical Data Feature Extraction and Intelligent Recognition Technology Based on Convolutional Neural Network
Weidong Liu ... Zuen Qin
IEEE access : practical innovations, open solutions | VOL. 7
Weidong Liu, et. al.Weidong Liu ... Zuen Qin
01 Jan 2019
IEEE access : practical innovations, open solutions | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence