Abstract

There is significant evidence that in addition to reward-punishment based decision making, the Basal Ganglia (BG) contributes to risk-based decision making (Balasubramani et al., 2014). Despite this evidence, little is known about the computational principles and neural correlates of risk computation in this subcortical system. We have previously proposed a reinforcement learning (RL)-based model of the BG that simulates the interactions between dopamine (DA) and serotonin (5HT) in a diverse set of experimental studies including reward, punishment and risk based decision making (Balasubramani et al., 2014). Starting with the classical idea that the activity of mesencephalic DA represents reward prediction error, the model posits that serotoninergic activity in the striatum controls risk-prediction error. Our prior model of the BG was an abstract model that did not incorporate anatomical and cellular-level data. In this work, we expand the earlier model into a detailed network model of the BG and demonstrate the joint contributions of DA-5HT in risk and reward-punishment sensitivity. At the core of the proposed network model is the following insight regarding cellular correlates of value and risk computation. Just as DA D1 receptor (D1R) expressing medium spiny neurons (MSNs) of the striatum were thought to be the neural substrates for value computation, we propose that DA D1R and D2R co-expressing MSNs are capable of computing risk. Though the existence of MSNs that co-express D1R and D2R are reported by various experimental studies, prior existing computational models did not include them. Ours is the first model that accounts for the computational possibilities of these co-expressing D1R-D2R MSNs, and describes how DA and 5HT mediate activity in these classes of neurons (D1R-, D2R-, D1R-D2R- MSNs). Starting from the assumption that 5HT modulates all MSNs, our study predicts significant modulatory effects of 5HT on D2R and co-expressing D1R-D2R MSNs which in turn explains the multifarious functions of 5HT in the BG. The experiments simulated in the present study relates 5HT to risk sensitivity and reward-punishment learning. Furthermore, our model is shown to capture reward-punishment and risk based decision making impairment in Parkinson's Disease (PD). The model predicts that optimizing 5HT levels along with DA medications might be essential for improving the patients' reward-punishment learning deficits.

Highlights

  • Decision making is related to choosing an action from a set of potential alternatives

  • We apply the model of 5HT and DA in the Basal Ganglia (BG) (Section Modeling the BG Network in Healthy Control Subjects) to explain several reward/punishment/riskbased decision making phenomena pertaining to the BG function

  • The other parameters: gain functions of the D1R, Dopamine D2 receptor (D2R), Dopamine D1 and D2 receptors (D1R-D2R) Medium Spiny Neuron (MSN) in the striatum equations (2.3.1, 2.2.3, 2.2.6); the model neuromodulator correlates for 5HT viz., αD1, αD2, αD1D2 that affect D1R, D2R, and the D1R-D2R MSNs, respectively; and DA parameters that condition Parkinson’s Disease (PD), are optimized for each experiment

Read more

Summary

Introduction

Decision making is related to choosing an action from a set of potential alternatives. (2) Representing the temporal gradient of the utility function [:=δU Equation (2.3.6)], used for switching between DP and IP (Chakravarthy and Balasubramani, 2014) For such a DA signal (:=δU) from the SNc, those neurons might be using the information of the value component received due to the D1R MSN projections from striatum to SNc (Schultz et al, 1997; Doya, 2002; Houk et al, 2007), and the risk component from the projections of D1R-D2R MSNs to SNc (Surmeier et al, 1996; Perreault et al, 2010, 2011). The same is verified by the model parameters αD1, αD2, and αD1D2 in various medication cases of PD (Section Modeling the Reward-punishment Sensitivity in PD)

Experiments and Results
Results
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call