The Computational Development of Reinforcement Learning during Adolescence.

Stefano Palminteri,Emma J Kilford,Giorgio Coricelli,Sarah-Jayne Blakemore

doi:10.1371/journal.pcbi.1004953

Abstract

Adolescence is a period of life characterised by changes in learning and decision-making. Learning and decision-making do not rely on a unitary system, but instead require the coordination of different cognitive processes that can be mathematically formalised as dissociable computational modules. Here, we aimed to trace the developmental time-course of the computational modules responsible for learning from reward or punishment, and learning from counterfactual feedback. Adolescents and adults carried out a novel reinforcement learning paradigm in which participants learned the association between cues and probabilistic outcomes, where the outcomes differed in valence (reward versus punishment) and feedback was either partial or complete (either the outcome of the chosen option only, or the outcomes of both the chosen and unchosen option, were displayed). Computational strategies changed during development: whereas adolescents’ behaviour was better explained by a basic reinforcement learning algorithm, adults’ behaviour integrated increasingly complex computational features, namely a counterfactual learning module (enabling enhanced performance in the presence of complete feedback) and a value contextualisation module (enabling symmetrical reward and punishment learning). Unlike adults, adolescent performance did not benefit from counterfactual (complete) feedback. In addition, while adults learned symmetrically from both reward and punishment, adolescents learned from reward but were less likely to learn from punishment. This tendency to rely on rewards and not to consider alternative consequences of actions might contribute to our understanding of decision-making in adolescence.

Highlights

Adolescence is defined as the period of life that starts with the biological changes of puberty and ends with the individual attainment of a stable, independent role in society[1]
Whereas simple reward learning has been largely and robustly associated with the striatum[17,18,19], punishment and counterfactual processing have been consistently associated with the dorsal prefrontal system and the insula, areas that are classically associated with cognitive control [13,20,21,22,23]
The model includes a factual learning module (Q-learning), which updates the value of the chosen option, a counterfactual learning module, which updates the value of the unchosen option and, a contextual learning module, which learns the average value of the choice context and uses this to move from an absolute to a relative encoding of option value

Summary

Introduction

Adolescence is defined as the period of life that starts with the biological changes of puberty and ends with the individual attainment of a stable, independent role in society[1] During this period, significant changes in value-based decision-making are observed[2]. Theories of adolescent brain development have pointed to differential functional and anatomical development of limbic regions, such as the striatum, and cognitive control regions and there is some evidence to support this notion [1,2,6,24,25,26] We hypothesise that this asymmetrical development might be translated into a difference in the computational strategies used by adolescents compared with adults. Differences in reinforcement learning strategies may in turn contribute to an explanation of features of adolescent value-directed behaviour

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Jun 20, 2016
Citations: 107	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Computational Development of Reinforcement Learning during Adolescence.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Exploration in neo-Hebbian reinforcement learning: Computational approaches to the exploration–exploitation balance with bio-inspired neural networks
Anthony Triche ... Ashok Kumar
Neural Networks | VOL. 151
Anthony Triche, et. al.Anthony Triche ... Ashok Kumar
23 Mar 2022
Neural Networks | VOL. 151

Social stress reactivity alters reward and punishment learning
James F Cavanagh ... John J B Allen
Social Cognitive and Affective Neuroscience | VOL. 6
James F Cavanagh, et. al.James F Cavanagh ... John J B Allen
07 May 2010
Social Cognitive and Affective Neuroscience | VOL. 6

General functioning predicts reward and punishment learning in schizophrenia
Zsuzsanna Somlai ... Mark A Gluck
Schizophrenia Research | VOL. 127
Zsuzsanna Somlai, et. al.Zsuzsanna Somlai ... Mark A Gluck
25 Aug 2010
Schizophrenia Research | VOL. 127

Exploring the effects of depression and treatment of depression in reinforcement learning
Pedro Castro-Rodrigues ... Albino J Oliveira-Maia
Frontiers in Integrative Neuroscience | VOL. 7
Pedro Castro-Rodrigues, et. al.Pedro Castro-Rodrigues ... Albino J Oliveira-Maia
01 Jan 2013
Frontiers in Integrative Neuroscience | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Computational Development of Reinforcement Learning during Adolescence.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology