Accounting for negative automaintenance in pigeons: a dual learning systems approach and factored representations.

Florian Lesaint,Olivier Sigaud,Mehdi Khamassi

doi:10.1371/journal.pone.0111050

Abstract

Animals, including Humans, are prone to develop persistent maladaptive and suboptimal behaviours. Some of these behaviours have been suggested to arise from interactions between brain systems of Pavlovian conditioning, the acquisition of responses to initially neutral stimuli previously paired with rewards, and instrumental conditioning, the acquisition of active behaviours leading to rewards. However the mechanics of these systems and their interactions are still unclear. While extensively studied independently, few models have been developed to account for these interactions. On some experiment, pigeons have been observed to display a maladaptive behaviour that some suggest to involve conflicts between Pavlovian and instrumental conditioning. In a procedure referred as negative automaintenance, a key light is paired with the subsequent delivery of food, however any peck towards the key light results in the omission of the reward. Studies showed that in such procedure some pigeons persisted in pecking to a substantial level despite its negative consequence, while others learned to refrain from pecking and maximized their cumulative rewards. Furthermore, the pigeons that were unable to refrain from pecking could nevertheless shift their pecks towards a harmless alternative key light. We confronted a computational model that combines dual-learning systems and factored representations, recently developed to account for sign-tracking and goal-tracking behaviours in rats, to these negative automaintenance experimental data. We show that it can explain the variability of the observed behaviours and the capacity of alternative key lights to distract pigeons from their detrimental behaviours. These results confirm the proposed model as an interesting tool to reproduce experiments that could involve interactions between Pavlovian and instrumental conditioning. The model allows us to draw predictions that may be experimentally verified, which could help further investigate the neural mechanisms underlying theses interactions.

Highlights

Persistent maladaptive and suboptimal behaviours are commonly observed in animals, including Humans, and supposed to result from possible constraints solved by the interaction of neural mechanisms not clearly identified yet
Classical negative automaintenance The central phenomenon that we intend to replicate with the present computational model is the greater or lesser persistence in pigeons to peck a key light that, while predictive of reward delivery, leads to its omission in case of contact
This model provides a plausible explanation, maybe partial, for the conflictual observations between the studies of Williams and Williams [8] and Sanabria et al [19]. It suggests that negative automaintenance arises from the competition of two reinforcement learning systems, one of which relies on factored representations to use values over features rather than states

Summary

Introduction

Persistent maladaptive and suboptimal behaviours are commonly observed in animals, including Humans, and supposed to result from possible constraints (e.g. energy versus efficiency tradeoff) solved by the interaction of neural mechanisms not clearly identified yet. Breland and Breland [1] studied animals that learned to retrieve rewards given some action (e.g. drop an object to get food). Guitart-Masip et al [3] showed that many humans have difficulties to learn to withhold from acting to get rewarded in a go/no-go task. These maladaptive behaviours have been suggested to arise from the interactions between multiple decision systems in the brain [4,5,6,7], namely Pavlovian and instrumental systems

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Oct 27, 2014
Citations: 64	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Accounting for negative automaintenance in pigeons: a dual learning systems approach and factored representations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Modelling individual differences in the form of Pavlovian conditioned approach responses: a dual learning systems approach with factored representations.
Florian Lesaint ... Shelly B Flagel
PLoS computational biology | VOL. 10
Florian Lesaint, et. al.Florian Lesaint ... Shelly B Flagel
13 Feb 2014
PLoS computational biology | VOL. 10

A comparison of neuronal reactions during classical and instrumental conditioning under similar conditions
Lev Tsitolovsky ... Alexander Shvedov
Neurobiology of Learning and Memory | VOL. 81
Lev Tsitolovsky, et. al.Lev Tsitolovsky ... Alexander Shvedov
02 Oct 2003
Neurobiology of Learning and Memory | VOL. 81

Review of Learning and behavior: A contemporary synthesis.
Darlene Skinner
Canadian Psychology / Psychologie canadienne | VOL. 48
Darlene SkinnerDarlene Skinner
01 Nov 2007
Canadian Psychology / Psychologie canadienne | VOL. 48

Chapter 15 - Comparison of Operant and Classical Conditioning of Feeding Behavior in Aplysia
Riccardo Mozzachiodi ... Douglas A Baxter
Handbook of Behavioral Neuroscience | VOL. 22
Riccardo Mozzachiodi, et. al.Riccardo Mozzachiodi ... Douglas A Baxter
01 Jan 2013
Handbook of Behavioral Neuroscience | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accounting for negative automaintenance in pigeons: a dual learning systems approach and factored representations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one