An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning.

Gianluca Baldassarre,William Lord,Vieri Giuliano Santucci,Giovanni Granato

doi:10.3389/fnbot.2019.00045

Abstract

We propose an architecture for the open-ended learning and control of embodied agents. The architecture learns action affordances and forward models based on intrinsic motivations and can later use the acquired knowledge to solve extrinsic tasks by decomposing them into sub-tasks, each solved with one-step planning. An affordance is here operationalized as the agent's estimate of the probability of success of an action performed on a given object. The focus of the work is on the overall architecture while single sensorimotor components are simplified. A key element of the architecture is the use of “active vision” that plays two functions, namely to focus on single objects and to factorize visual information into the object appearance and object position. These processes serve both the acquisition and use of object-related affordances, and the decomposition of extrinsic goals (tasks) into multiple sub-goals (sub-tasks). The architecture gives novel contributions on three problems: (a) the learning of affordances based on intrinsic motivations; (b) the use of active vision to decompose complex extrinsic tasks; (c) the possible role of affordances within planning systems endowed with models of the world. The architecture is tested in a simulated stylized 2D scenario in which objects need to be moved or “manipulated” in order to accomplish new desired overall configurations of the objects (extrinsic goals). The results show the utility of using intrinsic motivations to support affordance learning; the utility of active vision to solve composite tasks; and the possible utility of affordances for solving utility-based planning problems.

Highlights

IntroductionThe architecture has been developed within an open-ended learning context
This work proposes an architecture for the control and learning of embodied agents
Performance in the intrinsic phase was measured by evaluating the quality of the output of the predictors when receiving as input each one of the nine focused images corresponding to the nine possible objects (Figure 1)

Summary

Introduction

The architecture has been developed within an open-ended learning context. The general structure of the scenario involves two phases (Baldassarre, 2011; Seepanomwan et al, 2017): Open-Ended Learning: Affordance, Attention, and Planning (a) a first intrinsic motivation phase where the agent is not given any task and should freely explore the environment to autonomously acquire as much general-purpose knowledge as possible; (b) a second extrinsic motivation phase where the agent has to solve one or more tasks assigned externally within the same environment (extrinsic tasks). In the intrinsic phase of the specific scenario used here, the agent can perceive objects and explore and learn the effects of certain pre-wired actions (e.g., “move in space” or “change object color”). The agent is required to use the knowledge acquired in the intrinsic phase to solve extrinsic tasks: first the agent has to memorize the state of some objects set in a certain configuration (goal; notice how this is a handy way to allow the agent to store the goal in a format suitable for its processes); the objects are “shuffled” into a different state (“initial state”); last the agent has to bring the objects back to the goal state

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neurorobotics	Publication Date: Jul 26, 2019
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neurorobotics

Lead the way for us

Similar Papers

Undermining overjustification: Inducing intrinsic and extrinsic task representations
Joseph F Porac ... James Meindl
Organizational Behavior and Human Performance | VOL. 29
Joseph F Porac, et. al.Joseph F Porac ... James Meindl
01 Apr 1982
Organizational Behavior and Human Performance | VOL. 29

Undermining Intrinsic Motivation
Bianca Lavorata
SFU Educational Review | VOL. 6
Bianca LavorataBianca Lavorata
25 Sep 2013
SFU Educational Review | VOL. 6

In search of the neural circuits of intrinsic motivation
Frederic Kaplan
Frontiers in Neuroscience | VOL. 1
Frederic KaplanFrederic Kaplan
15 Oct 2007
Frontiers in Neuroscience | VOL. 1

A Model and Simulation of Early-Stage Vision as a Developmental Sensorimotor Process
Olivier L Georgeon ... Amélie V Cordier
-
Olivier L Georgeon, et. al.Olivier L Georgeon ... Amélie V Cordier
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neurorobotics