C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

Vieri Giuliano Santucci,Gianluca Baldassarre,Davide Montella

doi:10.1109/tcds.2022.3152081

Vieri Giuliano Santucci, Gianluca Baldassarre + Show 1 more

Open Access

https://doi.org/10.1109/tcds.2022.3152081

Copy DOI

Abstract

When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: 1) recognize which are the contexts that need different policies to perform the goals and 2) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.

Highlights

I N recent years the development of autonomous agents able to choose their own tasks solely on the basis of their interaction with the environment and of the motivations generated by this interaction, has gained increasing interest within artificial intelligence, robotics, and machine learning
Despite the simplicity of the experiments and the limitations of the specific implementation of C-GRAIL, the results show that the approach and the mechanisms introduced in this new architecture constitute a viable proposal for dealing with this type of problem
The case where the same goal might need different policies given different environmental conditions. This is common in realworld scenarios, where achieving the same goal may require a robot to use different strategies according to the context: for example, certain objects might be obstacles impairing the reaching of certain targets, the agent should use different policies to reach for the same location in different contexts

Summary

INTRODUCTION

I N recent years the development of autonomous agents able to choose their own tasks solely on the basis of their interaction with the environment and of the motivations generated by this interaction, has gained increasing interest within artificial intelligence, robotics, and machine learning. The paper is structured as follows: Sec. II introduces the general problem of autonomous, open-ended learning of multiple goals (Sec. II-A), the specific problem of autonomous goal learning under varying contexts (Sec. II-B), and our proposed solution (Sec. II-C); Sec. III describes C-GRAIL general functioning (Sec. III-A) and its mechanisms and components (Sec. III-B), while specific implementation details can be found in the Appendix; Sec. IV describes the experimental scenario and the compared systems, while results are presented in Sec. V; in Sec. VI we draw conclusions, highlight limitations, and describe possible future extensions of this work

Autonomous learning of multiple goals

Introducing context dependency

Proposed solution

General functioning

Implementation

Environment and task

Compared systems

RESULTS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on autonomous mental development	Publication Date: Mar 1, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE transactions on autonomous mental development

Lead the way for us

Similar Papers

Autonomous Representation Learning in a Developing Agent
Jonathan Mugan ... Benjamin Kuipers
-
Jonathan Mugan, et. al.Jonathan Mugan ... Benjamin Kuipers
01 Jan 2013
01 Jan 2013

Unsupervised Visuomotor Control through Distributional Planning Networks
Tianhe Yu ... Gleb Shevchuk
-
Tianhe Yu, et. al.Tianhe Yu ... Gleb Shevchuk
22 Jun 2019
22 Jun 2019

Analysing autonomous open-ended learning of skills with different interdependent subgoals in robots
Alejandro Romero ... Vieri Giuliano Santucci
-
Alejandro Romero, et. al.Alejandro Romero ... Vieri Giuliano Santucci
06 Dec 2021
06 Dec 2021

Autonomous Reinforcement Learning of Multiple Interrelated Tasks
Vieri Giuliano Santucci ... Emilio Cartoni
-
Vieri Giuliano Santucci, et. al.Vieri Giuliano Santucci ... Emilio Cartoni
01 Aug 2019
01 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE transactions on autonomous mental development