A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

Ryunosuke Amo,Akihiro Yamanaka,Naoshige Uchida,Mitsuko Watabe-Uchida,Sara Matias,Kenji F Tanaka

doi:10.1038/s41593-022-01109-2

Ryunosuke Amo, Akihiro Yamanaka + Show 4 more

Open Access

https://doi.org/10.1038/s41593-022-01109-2

Copy DOI

Abstract

A large body of evidence has indicated that the phasic responses of midbrain dopamine neurons show a remarkable similarity to a type of teaching signal (temporal difference (TD) error) used in machine learning. However, previous studies failed to observe a key prediction of this algorithm: that when an agent associates a cue and a reward that are separated in time, the timing of dopamine signals should gradually move backward in time from the time of the reward to the time of the cue over multiple trials. Here we demonstrate that such a gradual shift occurs both at the level of dopaminergic cellular activity and dopamine release in the ventral striatum in mice. Our results establish a long-sought link between dopaminergic activity and the TD learning algorithm, providing fundamental insights into how the brain associates cues and rewards that are separated in time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature neuroscience	Publication Date: Jul 7, 2022
Citations: 48	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

Abstract

Talk to us

Similar Papers

More From: Nature neuroscience

Lead the way for us

Similar Papers

Which Temporal Difference learning algorithm best reproduces dopamine activity in a multi-choice task?
Jean Bellot ... Mehdi Khamassi
BMC Neuroscience | VOL. 14
Jean Bellot, et. al.Jean Bellot ... Mehdi Khamassi
01 Jul 2013
BMC Neuroscience | VOL. 14

Which Temporal Difference Learning Algorithm Best Reproduces Dopamine Activity in a Multi-choice Task?
Jean Bellot ... Olivier Sigaud
-
Jean Bellot, et. al.Jean Bellot ... Olivier Sigaud
01 Jan 2012
01 Jan 2012

Turnover rate and stimulus-evoked release of dopamine by progesterone and N-methyl- d-aspartic acid in rat striatum during pregnancy
Ricardo J Cabrera ... Claudia Bregonzio
European Journal of Pharmacology | VOL. 317
Ricardo J Cabrera, et. al.Ricardo J Cabrera ... Claudia Bregonzio
01 Dec 1996
European Journal of Pharmacology | VOL. 317

The second order temporal difference error for Sarsa(λ)
Qiming Fu ... Quan Liu
-
Qiming Fu, et. al.Qiming Fu ... Quan Liu
01 Apr 2013
01 Apr 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

Abstract

Talk to us

Similar Papers

More From: Nature neuroscience