Abstract

SummaryDeciding between stimuli requires combining their learned value with one’s sensory confidence. We trained mice in a visual task that probes this combination. Mouse choices reflected not only present confidence and past rewards but also past confidence. Their behavior conformed to a model that combines signal detection with reinforcement learning. In the model, the predicted value of the chosen option is the product of sensory confidence and learned value. We found precise correlates of this variable in the pre-outcome activity of midbrain dopamine neurons and of medial prefrontal cortical neurons. However, only the latter played a causal role: inactivating medial prefrontal cortex before outcome strengthened learning from the outcome. Dopamine neurons played a causal role only after outcome, when they encoded reward prediction errors graded by confidence, influencing subsequent choices. These results reveal neural signals that combine reward value with sensory confidence and guide subsequent learning.

Highlights

  • Making decisions often requires combining present sensory evidence with previous reward values and learning from the resulting outcome

  • Behavioral Signatures of Learning Guided by Sensory Confidence and Reward Value To study decisions guided by sensory signals and reward values, we developed a task for head-fixed mice (Figures 1A– 1C)

  • Learning Depends on Prediction Error, but Not Predicted Value, Signaled by Dopamine Neurons Having observed that dopamine signals encode predicted value QC before outcome and prediction error d after outcome, we investigated their impact on choices (Figure 5)

Read more

Summary

Introduction

Making decisions often requires combining present sensory evidence with previous reward values and learning from the resulting outcome. It is not known, how the brain performs these computations. Animals and humans efficiently combine these computations (Fan et al, 2018; Feng et al., 2009; Hirokawa et al, 2017; Whiteley and Sahani, 2008). It is not known what neuronal signals underlie this combination

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call