Reward Rate Research Articles

Objective:Combat exposure is associated with higher rates of depressive symptoms, including anhedonia (i.e., a reduced ability to seek and experience rewards) and feelings of social disconnectedness. While these symptoms are commonly documented in combat-exposed Veterans following deployment, the cognitive mechanisms underlying this pathology is less well understood. Computational modeling can provides detailed mechanistic insights into complex cognition, which may be particularly useful to understand how social reward processing is altered following combat exposure. Here, we use a Bayesian learning model framework to address this question.Participants and Methods:Thirty-three Operation Enduring Freedom (OEF)/ Operation Iraqi Freedom (OIF)/Operation New Dawn (OND) Veterans (25 Male, 8 Female) between the ages of 18-65 years old (M = 41.61, SD = 10.49) participated in this study. In both classic/monetary and social reward conditions, participants completed a 2-arm bandit task, in which they must choose on each trial between two options (i.e., slot machine vs social partner) with unknown reward rates. While they received monetary outcomes in the classic condition, participants received compliments from different fictitious partners in the social condition. We first compared a learning-independent Win-stay/Lose-shift (WSLS) heuristic and either a Rescorla-Wagner Q-learning or a Bayesian learning model (Dynamic Belief Model/DBM) paired with a Softmax reward maximization policy. DBM+Softmax provided the best fit of the data for most participants (31/33). Individual DBM parameters of prior reward expectation, reward learning (i.e., perceived stability of reward rates), and Softmax reward maximization were estimated and compared across conditions.Results:Participants did not differ in their reward learning parameters across monetary and social conditions (t(30)= -0.70, p = 0.490), suggesting similar perception of reward stability in both modalities. However, higher Bayesian prior mean (i.e., initial belief of reward rate; t(30)= -2.31, p = 0.028, d=0.42) and greater reward maximization (i.e., Softmax parameter; t(30)= -2.26, p = 0.031, d=0.41) were observed in response to social vs monetary rewards. In the social reward condition, higher self-reported social connectedness was associated with greater model fit of our DBM model (i.e., smaller Bayesian Information Criterion/BIC; r = -0.38, p = 0.041). In this condition, those expecting higher reward rates when initiating reward exploration (those with higher DBM prior mean) endorsed lower self-esteem (Spearman's ρ = -0.43, p = 0.078) and lower positive affect (ρ = -0.32, p = 0.078).Conclusions:A Bayesian learning modeling framework can characterize mechanistic differences in the processing of social vs non-social reward among combat-exposed Veterans. Individuals with higher social connectedness were more model-based in their performance, consistent with the notion that they are more likely to estimate and anticipate how much social peers have to offer. Combat-exposed individuals with lower self-esteem and positive affect appear to have higher initial expectations of reward from unknown partners, which could reflect greater need for mood and/or self-esteem repair in those individuals. Overall, Bayesian modeling of social reward behavior provides a useful quantitative framework to predict clinically relevant construct of functional outcomes in military populations.

Outcomes and feedbacks on performance may influence behavior beyond the context in which it was received, yet it remains unclear what neurobehavioral mechanisms may account for such lingering influences on behavior. The average reward rate (ARR) has been suggested to regulate motivated behavior, and was found to interact with dopamine-sensitive cognitive processes, such as vigilance and associative memory encoding. The ARR could therefore provide a bridge between independent tasks when these are performed in temporal proximity, such that the reward rate obtained in one task could influence performance in a second subsequent task. Reinforcement learning depends on the coding of prediction error signals by dopamine neurons and their downstream targets, in particular the nucleus accumbens. Because these brain regions also respond to changes in ARR, reinforcement learning may be vulnerable to changes in ARR. To test this hypothesis, we designed a novel paradigm in which participants (n = 245) performed two probabilistic reinforcement learning tasks presented in interleaved trials. The ARR was controlled by an "induction" task which provided feedback with a low (p = 0.58), a medium (p = 0.75), or a high probability of reward (p = 0.92), while the impact of ARR on reinforcement learning was tested by a second "reference" task with a constant reward probability (p = 0.75). We find that performance was significantly lower in the reference task when the induction task provided low reward probabilities (i.e., during low levels of ARR), as compared to the medium and high ARR conditions. Behavioral modeling further revealed that the influence of ARR is best described by models which accumulates average rewards (rather than average prediction errors), and where the ARR directly modulates the prediction error signal (rather than affecting learning rates or exploration). Our results demonstrate how affective information in one domain may transfer and affect motivated behavior in other domains. These findings are particularly relevant for understanding mood disorders, but may also inform abnormal behaviors attributed to dopamine dysfunction.

Reward Rate Research Articles

Related Topics

Articles published on Reward Rate

34 Neurocomputational Mechanisms of Social Reward Processing in Combat-Exposed Veterans

Real, potentially real, and hypothetical monetary rewards in probability discounting.

Reduced Sensitivity to Background Reward Underlies Apathy After Traumatic Brain Injury: Insights From an Ecological Foraging Framework.

Managing Customer Churn via Service Mode Control

Comparison of three participation modes on ride-hailing platforms

When Smaller Sooner Depletes a Pool of Resources Faster.

Experience-dependent evolution of odor mixture representations in piriform cortex.

Capacity investment of shore power berths for a container port: Environmental incentive and infrastructure subsidy policies

Impact of decision and action outcomes on subsequent decision and action behaviours in humans.

A stochastic time scale based framework for system reliability under a Markovian dynamic environment

Genetic variation in the dopamine system is associated with mixed-strategy decision-making in patients with Parkinson's disease.

Overconsumption as a function of how individuals make choices: A paper in honor of Howard Rachlin's contributions to psychology.

Average reward rates enable motivational transfer across independent reinforcement learning tasks

Zero-sum semi-Markov games with state-action-dependent discount factors

Normative decision rules in changing environments.

The Average Reward Rate Modulates Behavioral and Neural Indices of Effortful Control Allocation.

Incentive Framework for Cross-Device Federated Learning and Analytics With Multiple Tasks Based on a Multi-Leader-Follower Game

Mice are Near Optimal Timers

Critical periods when dopamine controls behavioral responding during Pavlovian learning.

Dopamine and reward-related vigor in younger and older adults

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reward Rate Research Articles

Related Topics

Articles published on Reward Rate

34 Neurocomputational Mechanisms of Social Reward Processing in Combat-Exposed Veterans

Real, potentially real, and hypothetical monetary rewards in probability discounting.

Reduced Sensitivity to Background Reward Underlies Apathy After Traumatic Brain Injury: Insights From an Ecological Foraging Framework.

Managing Customer Churn via Service Mode Control

Comparison of three participation modes on ride-hailing platforms

When Smaller Sooner Depletes a Pool of Resources Faster.

Experience-dependent evolution of odor mixture representations in piriform cortex.

Capacity investment of shore power berths for a container port: Environmental incentive and infrastructure subsidy policies

Impact of decision and action outcomes on subsequent decision and action behaviours in humans.

A stochastic time scale based framework for system reliability under a Markovian dynamic environment

Genetic variation in the dopamine system is associated with mixed-strategy decision-making in patients with Parkinson's disease.

Overconsumption as a function of how individuals make choices: A paper in honor of Howard Rachlin's contributions to psychology.

Average reward rates enable motivational transfer across independent reinforcement learning tasks

Zero-sum semi-Markov games with state-action-dependent discount factors

Normative decision rules in changing environments.

The Average Reward Rate Modulates Behavioral and Neural Indices of Effortful Control Allocation.

Incentive Framework for Cross-Device Federated Learning and Analytics With Multiple Tasks Based on a Multi-Leader-Follower Game

Mice are Near Optimal Timers

Critical periods when dopamine controls behavioral responding during Pavlovian learning.

Dopamine and reward-related vigor in younger and older adults