Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making.

Amir Dezfouli,Bernard W Balleine

doi:10.1371/journal.pcbi.1007334

Abstract

State-space and action representations form the building blocks of decision-making processes in the brain; states map external cues to the current situation of the agent whereas actions provide the set of motor commands from which the agent can choose to achieve specific goals. Although these factors differ across environments, it is currently unknown whether or how accurately state and action representations are acquired by the agent because previous experiments have typically provided this information a priori through instruction or pre-training. Here we studied how state and action representations adapt to reflect the structure of the world when such a priori knowledge is not available. We used a sequential decision-making task in rats in which they were required to pass through multiple states before reaching the goal, and for which the number of states and how they map onto external cues were unknown a priori. We found that, early in training, animals selected actions as if the task was not sequential and outcomes were the immediate consequence of the most proximal action. During the course of training, however, rats recovered the true structure of the environment and made decisions based on the expanded state-space, reflecting the multiple stages of the task. Similarly, we found that the set of actions expanded with training, although the emergence of new action sequences was sensitive to the experimental parameters and specifics of the training procedure. We conclude that the profile of choices shows a gradual shift from simple representations to more complex structures compatible with the structure of the world.

Highlights

In sequential decision-making tasks, an agent makes a series of choices and passes through several states before earning rewards
Two stage decision-making in rats and analysis, decision to publish, or preparation of the manuscript
The rats received training on a two-stage decision-making task, in which they first made a binary choice at stage 1 (S0), after which they transitioned to one of the stage 2 states, in which again they made another binary choice that could lead to either reward delivery or no-reward (Fig 2a)

Summary

Introduction

In sequential decision-making tasks, an agent makes a series of choices and passes through several states before earning rewards. Learning the state-space of the task is crucial in allowing the agent to navigate within the environment, and provides building blocks for various forms of reinforcement-learning algorithms in the brain [1, 2]. This process involves considering different events and cues that occur after taking each action, and integrating them in order to recover how many states the task has and how they are related to external cues. At present, there is no direct evidence for such adaptive state-space representations in decision-making situations

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Sep 6, 2019
Citations: 16	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Transfer of learning by composing solutions of elemental sequential tasks
Satinder Pal Singh
Machine Learning | VOL. 8
Satinder Pal SinghSatinder Pal Singh
01 May 1992
Machine Learning | VOL. 8

Approaches for Action Sequence Representation in Robotics: A Review
Hirenkumar Nakawala ... Giancarlo Ferringo
-
Hirenkumar Nakawala, et. al.Hirenkumar Nakawala ... Giancarlo Ferringo
01 Oct 2018
01 Oct 2018

Open-Ended Learning: A Conceptual Framework Based on Representational Redescription.
Stephane Doncieux ... Diederik M Roijers
Frontiers in Neurorobotics | VOL. 12
Stephane Doncieux, et. al.Stephane Doncieux ... Diederik M Roijers
25 Sep 2018
Frontiers in Neurorobotics | VOL. 12

Multi-resolution Corrective Demonstration for Efficient Task Execution and Refinement
Çetin Meriçli ... Manuela Veloso
International Journal of Social Robotics | VOL. 4
Çetin Meriçli, et. al.Çetin Meriçli ... Manuela Veloso
05 Jul 2012
International Journal of Social Robotics | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology