Statistical mechanics of structural and temporal credit assignment effects on learning in neural networks

Hiroshi Saito,Kazuo Okanoya,Masato Okada,Kentaro Katahira

doi:10.1103/physreve.83.051125

Hiroshi Saito, Kazuo Okanoya + Show 2 more

https://doi.org/10.1103/physreve.83.051125

Copy DOI

Abstract

Neural networks can learn flexible input-output associations by changing their synaptic weights. The representational performance and learning dynamics of neural networks are intensively studied in several fields. Neural networks face the "credit assignment problem" in situations in which only incomplete performance evaluations are available. The credit assignment problem is that a network should assign credit or blame for its behaviors according to the contribution to the network performance. In reinforcement learning, a scalar evaluation signal is delivered to a network. The two main types of credit assignment problems in reinforcement learning are structural and temporal, that is, which parameters of the network (structural) and which past network activities (temporal) are related to an evaluation signal given from an environment. In this study, we apply statistical mechanical analysis to the learning processes in a simple neural network model to clarify the effects of two kinds of credit assignments and their interactions. Our model is based on node perturbation learning with eligibility trace. Node perturbation is a stochastic gradient learning method that can solve structural credit assignment problems by introducing a perturbation into the system output. The eligibility trace preserves the past network activities with a temporal credit to deal with the delay of an instruction signal. We show that both credit assignment effects mutually interact and the optimal time constant of the eligibility trace varies not only for the evaluation delay but also the network size.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Statistical mechanics of structural and temporal credit assignment effects on learning in neural networks

Abstract

Talk to us

Similar Papers

More From: Physical Review E

Lead the way for us

Journal: Physical Review E	Publication Date: May 20, 2011
Citations: 6

Similar Papers

Short-term memory traces for action bias in human reinforcement learning
Rafal Bogacz ... P Read Montague
Brain Research | VOL. 1153
Rafal Bogacz, et. al.Rafal Bogacz ... P Read Montague
24 Mar 2007
Brain Research | VOL. 1153

Solving the Credit Assignment Problem: The Interaction of Explicit and Implicit Learning with Internal and External State Information
...
-
, et. al. ...
17 Apr 2017
17 Apr 2017

Expected Eligibility Traces
Hado Van Hasselt ... Matteo Hessel
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Hado Van Hasselt, et. al.Hado Van Hasselt ... Matteo Hessel
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Choice-selective sequences dominate in cortical relative to thalamic inputs to NAc to support reinforcement learning.
Nathan F Parker ... Laura M Haetzel
Cell Reports | VOL. 39
Nathan F Parker, et. al.Nathan F Parker ... Laura M Haetzel
01 May 2022
Cell Reports | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical mechanics of structural and temporal credit assignment effects on learning in neural networks

Abstract

Talk to us

Similar Papers

More From: Physical Review E