Abstract

When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: 1) recognize which are the contexts that need different policies to perform the goals and 2) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.

Highlights

  • I N recent years the development of autonomous agents able to choose their own tasks solely on the basis of their interaction with the environment and of the motivations generated by this interaction, has gained increasing interest within artificial intelligence, robotics, and machine learning

  • Despite the simplicity of the experiments and the limitations of the specific implementation of C-GRAIL, the results show that the approach and the mechanisms introduced in this new architecture constitute a viable proposal for dealing with this type of problem

  • The case where the same goal might need different policies given different environmental conditions. This is common in realworld scenarios, where achieving the same goal may require a robot to use different strategies according to the context: for example, certain objects might be obstacles impairing the reaching of certain targets, the agent should use different policies to reach for the same location in different contexts

Read more

Summary

INTRODUCTION

I N recent years the development of autonomous agents able to choose their own tasks solely on the basis of their interaction with the environment and of the motivations generated by this interaction, has gained increasing interest within artificial intelligence, robotics, and machine learning. The paper is structured as follows: Sec. II introduces the general problem of autonomous, open-ended learning of multiple goals (Sec. II-A), the specific problem of autonomous goal learning under varying contexts (Sec. II-B), and our proposed solution (Sec. II-C); Sec. III describes C-GRAIL general functioning (Sec. III-A) and its mechanisms and components (Sec. III-B), while specific implementation details can be found in the Appendix; Sec. IV describes the experimental scenario and the compared systems, while results are presented in Sec. V; in Sec. VI we draw conclusions, highlight limitations, and describe possible future extensions of this work

Autonomous learning of multiple goals
Introducing context dependency
Proposed solution
General functioning
Implementation
Environment and task
Compared systems
RESULTS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call