Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models

Ian C Ballard,Samuel M Mcclure

doi:10.1016/j.jneumeth.2019.01.006

Abstract

BackgroundReinforcement learning models provide excellent descriptions of learning in multiple species across a variety of tasks. Many researchers are interested in relating parameters of reinforcement learning models to neural measures, psychological variables or experimental manipulations. We demonstrate that parameter identification is difficult because a range of parameter values provide approximately equal quality fits to data. This identification problem has a large impact on power: we show that a researcher who wants to detect a medium sized correlation (r = .3) with 80% power between a variable and learning rate must collect 60% more subjects than specified by a typical power analysis in order to account for the noise introduced by model fitting. New methodWe derive a Bayesian optimal model fitting technique that takes advantage of information contained in choices and reaction times to constrain parameter estimates. ResultsWe show using simulation and empirical data that this method substantially improves the ability to recover learning rates. Comparison with existing methodsWe compare this method against the use of Bayesian priors. We show in simulations that the combined use of Bayesian priors and reaction times confers the highest parameter identifiability. However, in real data where the priors may have been misspecified, the use of Bayesian priors interferes with the ability of reaction time data to improve parameter identifiability. ConclusionsWe present a simple technique that takes advantage of readily available data to substantially improve the quality of inferences that can be drawn from parameters of reinforcement learning models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Neuroscience Methods	Publication Date: Jan 18, 2019
Citations: 46	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models

Abstract

Talk to us

Similar Papers

More From: Journal of Neuroscience Methods

Lead the way for us

Similar Papers

Author response: Associability-modulated loss learning is increased in posttraumatic stress disorder
Vanessa M Brown ... John M Wang
-
Vanessa M Brown, et. al.Vanessa M Brown ... John M Wang
19 Oct 2017
19 Oct 2017

Author response: DYT1 dystonia increases risk taking in humans
David Arkadir ... Susan B Bressman
-
David Arkadir, et. al.David Arkadir ... Susan B Bressman
26 Apr 2016
26 Apr 2016

增強學習之酬賞預測誤差：精神病、性格及模型探討

-

01 Jan 2012
01 Jan 2012

Stress, individual differences, and norepinephrine in reinforcement learning-based prediction of mouse behavior in conditioning and spatial learning
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models

Abstract

Talk to us

Similar Papers

More From: Journal of Neuroscience Methods