Open-Ended Learning: A Conceptual Framework Based on Representational Redescription.

Stephane Doncieux,Olivier Sigaud,Nicolas Perrin,Diederik M Roijers,Richard Duro,Alexandre Coninx,Timothy Hospedales,David Filliat,Natalia Díaz-Rodríguez,Benoît Girard

doi:10.3389/fnbot.2018.00059

Abstract

Reinforcement learning (RL) aims at building a policy that maximizes a task-related reward within a given domain. When the domain is known, i.e., when its states, actions and reward are defined, Markov Decision Processes (MDPs) provide a convenient theoretical framework to formalize RL. But in an open-ended learning process, an agent or robot must solve an unbounded sequence of tasks that are not known in advance and the corresponding MDPs cannot be built at design time. This defines the main challenges of open-ended learning: how can the agent learn how to behave appropriately when the adequate states, actions and rewards representations are not given? In this paper, we propose a conceptual framework to address this question. We assume an agent endowed with low-level perception and action capabilities. This agent receives an external reward when it faces a task. It must discover the state and action representations that will let it cast the tasks as MDPs in order to solve them by RL. The relevance of the action or state representation is critical for the agent to learn efficiently. Considering that the agent starts with a low level, task-agnostic state and action spaces based on its low-level perception and action capabilities, we describe open-ended learning as the challenge of building the adequate representation of states and actions, i.e., of redescribing available representations. We suggest an iterative approach to this problem based on several successive Representational Redescription processes, and highlight the corresponding challenges in which intrinsic motivations play a key role.

Highlights

Robots need world representations in terms of objects, actions, plans, etc
We assume the agent is endowed with reinforcement learning (RL) capabilities efficient enough to let it learn to solve a task when cast as a Markov Decision Process (MDP)
To make a step toward openended learning, we propose a conceptual framework for representational redescription processes based on a formal definition of states and actions

Summary

INTRODUCTION

Robots need world representations in terms of objects, actions, plans, etc. Currently such representations are carefully designed and adapted to the robot’s task (Kober et al, 2013). Representational redescription is the ability to discover new representations based on existing ones It is a key ability of human intelligence (Karmiloff-Smith, 1995) that remains a challenge in robotics (Oudeyer, 2015). We assume the agent is endowed with reinforcement learning (RL) capabilities efficient enough to let it learn to solve a task when cast as a Markov Decision Process (MDP). From these assumptions, the main challenge in our framework is determining how an agent can discover the state and action representations that let it cast tasks as MDPs, before solving them by RL (Zimmer and Doncieux, 2018). We highlight the challenges it raises, notably in terms of intrinsic motivations

THE REPRESENTATIONAL REDESCRIPTION APPROACH

Markov Decision Processes

States

Reward Functions and Goals

Actions

Representational Redescription

Motor Skills

Open-Ended Learning

Intrinsic Motivations

CHALLENGES

DISCUSSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neurorobotics	Publication Date: Sep 25, 2018
Citations: 49	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Open-Ended Learning: A Conceptual Framework Based on Representational Redescription.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neurorobotics

Lead the way for us

Similar Papers

Reward-predictive representations generalize across tasks in reinforcement learning
Lucas Lehnert ... Michael L Littman
-
Lucas Lehnert, et. al.Lucas Lehnert ... Michael L Littman
15 Oct 2020
15 Oct 2020

Reward-predictive representations generalize across tasks in reinforcement learning.
Lucas Lehnert ... Michael J Frank
PLoS computational biology | VOL. 16
Lucas Lehnert, et. al.Lucas Lehnert ... Michael J Frank
15 Oct 2020
PLoS computational biology | VOL. 16

Focused Crawling Through Reinforcement Learning
Miyoung Han ... Pierre-Henri Wuillemin
-
Miyoung Han, et. al.Miyoung Han ... Pierre-Henri Wuillemin
01 Jan 2018
01 Jan 2018

Feature Reinforcement Learning: Part I. Unstructured MDPs
Marcus Hutter
Journal of Artificial General Intelligence | VOL. 1
Marcus HutterMarcus Hutter
01 Jan 2009
Journal of Artificial General Intelligence | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Open-Ended Learning: A Conceptual Framework Based on Representational Redescription.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neurorobotics