Learning disentangled skills for hierarchical reinforcement learning through trajectory autoencoder with weak labels

Wonil Song,Sangryul Jeon,Hyesong Choi,Kwanghoon Sohn,Dongbo Min

doi:10.1016/j.eswa.2023.120625

Abstract

Typically, hierarchical reinforcement learning (RL) requires skills that are applicable to various downstream tasks. Although several recent studies have proposed the supervised and unsupervised learning of such skills, the learned skills are often entangled, which hinders their interpretation. To alleviate this, we propose a novel method to use weak labels for learning disentangled skills from the continuous latent representations of trajectories. To this end, we extended a trajectory variational autoencoder (VAE) to impose an inductive bias using weak labels, which explicitly enforces the disentangling of the trajectory representations into factors of interest intended for the model to learn. Using the latent representations as skills, a skill-based policy network is trained to generate trajectories similar to the learned decoder of the trajectory VAE. Furthermore, using the disentangled skill, we propose a skill repetition that can expand the entire trajectories generated by the policy at test time, resulting in an effective planning strategy. Experiments were performed on several challenging navigation tasks in mazes, and the results demonstrate the effectiveness of our method at solving hierarchical RL problems even with a long horizon and sparse rewards.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning disentangled skills for hierarchical reinforcement learning through trajectory autoencoder with weak labels

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Jun 2, 2023
Citations: 1

Similar Papers

Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification
Chenghao Liu ... Quan Liu
IEEE/CAA Journal of Automatica Sinica | VOL. 8
Chenghao Liu, et. al.Chenghao Liu ... Quan Liu
01 Oct 2021
IEEE/CAA Journal of Automatica Sinica | VOL. 8

Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh ... Rajbala Makar
Autonomous Agents and Multi-Agent Systems | VOL. 13
Mohammad Ghavamzadeh, et. al.Mohammad Ghavamzadeh ... Rajbala Makar
04 Apr 2006
Autonomous Agents and Multi-Agent Systems | VOL. 13

Hierarchical and Non-Hierarchical Multi-Agent Interactions Based on Unity Reinforcement Learning
...
-
, et. al. ...
10 May 2020
10 May 2020

Studies on Hierarchical Reinforcement Learning in Multi-Agent Environment
Yu Lasheng ... Lin Jian
-
Yu Lasheng, et. al.Yu Lasheng ... Lin Jian
01 Apr 2008
01 Apr 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning disentangled skills for hierarchical reinforcement learning through trajectory autoencoder with weak labels

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications