Abstract

Extended training can induce a shift in behavioral control from goal-directed actions, which are governed by action-outcome contingencies and sensitive to changes in the expected value of the outcome, to habits which are less dependent on action-outcome relations and insensitive to changes in outcome value. Previous studies in rats have shown that interval schedules of reinforcement favor habit formation while ratio schedules favor goal-directed behavior. However, the molecular mechanisms underlying habit formation are not well understood. Endocannabinoids, which can function as retrograde messengers acting through presynaptic CB1 receptors, are highly expressed in the dorsolateral striatum, a key region involved in habit formation. Using a reversible devaluation paradigm, we confirmed that in mice random interval schedules also favor habit formation compared with random ratio schedules. We also found that training with interval schedules resulted in a preference for exploration of a novel lever, whereas training with ratio schedules resulted in less generalization and more exploitation of the reinforced lever. Furthermore, mice carrying either a heterozygous or a homozygous null mutation of the cannabinoid receptor type I (CB1) showed reduced habit formation and enhanced exploitation. The impaired habit formation in CB1 mutant mice cannot be attributed to chronic developmental or behavioral abnormalities because pharmacological blockade of CB1 receptors specifically during training also impairs habit formation. Taken together our data suggest that endocannabinoid signaling is critical for habit formation.

Highlights

  • We can learn to perform particular actions to obtain specific outcomes in our environments through a process of trial and error

  • The rate of reinforcements per lever press changed with training (F6,132 = 3716.66, p < 0.001), and there was a significant difference between the ratio and interval groups (F1,22 = 10.7, p = 0.003; post hoc analyses show a difference between schedules in training days 4 and 5), there was no interaction between training and schedule of reinforcement (F4,19 = 2.80, p = 0.06) (Figure 1D)

  • In this study, using genetic and pharmacological tools in mice we showed that endocannabinoid signaling through cannabinoid receptors type 1 (CB1) receptors is critical for habit formation

Read more

Summary

Introduction

We can learn to perform particular actions to obtain specific outcomes in our environments through a process of trial and error These actions are goal-directed, and their performance is highly sensitive to changes in the incentive value of the outcome, and to changes in the contingency between the action and the outcome. Previous studies in rats have shown that extensive training on an instrumental task where animals lever press for particular food reinforcements can lead to a shift from goal-directed responding, which is sensitive to changes in the value of the outcome, to habitual responding which is insensitive to outcome devaluation and can be elicited by antecedent stimuli (Adams, 1982; Adams and Dickinson, 1981b). Shifts from goal-directed to habitual responding can be produced by extended training, and by different schedules of reinforcement, with random interval schedules favoring the formation of habits compared with random ratio schedules (Adams and Dickinson, 1981b; Dickinson, 1985; Dickinson et al, 1983).

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call