SCORE: Skill-Conditioned Online Reinforcement Learning

Sara Karimi,Sahar Asadi,Amir H Payberah

doi:10.1609/aiide.v20i1.31879

Abstract

Solving complex long-horizon tasks through Reinforcement Learning (RL) from scratch presents challenges related to efficient exploration. Two common approaches to reduce complexity and enhance exploration efficiency are (i) integrating learning-from-demonstration techniques with online RL, where the prior knowledge acquired from demonstrations is used to guide exploration, refine representations, or tailor reward functions, and (ii) using representation learning to facilitate state abstraction. In this study, we present Skill-Conditioned Online REinforcement Learning (SCORE), a novel approach that leverages these two strategies and utilizes skills acquired from an unstructured demonstrations dataset in a policy gradient RL algorithm. This integration enriches the algorithm with informative input representations, improving downstream task learning and exploration efficiency. We evaluate our method on long-horizon robotic and navigation tasks and game environments, demonstrating enhancements in online RL performance compared to the baselines. Furthermore, we show our approach’s generalization capabilities and analyze its effectiveness through an ablation study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SCORE: Skill-Conditioned Online Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment

Lead the way for us

Similar Papers

A Cooperation Online Reinforcement Learning Approach in Ant-Q
Seunggwan Lee
-
Seunggwan LeeSeunggwan Lee
01 Jan 2006
01 Jan 2006

EXPERIMENTS WITH ONLINE REINFORCEMENT LEARNING IN REAL-TIME STRATEGY GAMES
Kresten Toftgaard Andersen ... Dung Tran
Applied Artificial Intelligence | VOL. 23
Kresten Toftgaard Andersen, et. al.Kresten Toftgaard Andersen ... Dung Tran
22 Oct 2009
Applied Artificial Intelligence | VOL. 23

Reinforcement Learning for Energy-Storage Systems in Grid-Connected Microgrids: An Investigation of Online vs. Offline Implementation
Khawaja Haider Ali ... Mohammad Abusara
Energies | VOL. 14
Khawaja Haider Ali, et. al.Khawaja Haider Ali ... Mohammad Abusara
09 Sep 2021
Energies | VOL. 14

Dual-Layer Q-Learning Strategy for Energy Management of Battery Storage in Grid-Connected Microgrids
Khawaja Haider Ali ... Mohammad Abusara
Energies | VOL. 16
Khawaja Haider Ali, et. al.Khawaja Haider Ali ... Mohammad Abusara
27 Jan 2023
Energies | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SCORE: Skill-Conditioned Online Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment