Abstract
Finding rewards and avoiding punishments are powerful goals of behavior. To maximize reward and minimize punishment, it is beneficial to learn about the stimuli that predict their occurrence, and decades of research have provided insight into the brain processes underlying such associative reinforcement learning. In addition, it is well known in experimental psychology, yet often unacknowledged in neighboring scientific disciplines, that subjects also learn about the stimuli that predict the absence of reinforcement. Here we evaluate evidence for both these learning processes. We focus on two study cases that both provide a baseline level of behavior against which the effects of associative learning can be assessed. Firstly, we report pertinent evidence from Drosophila larvae. A re-analysis of the literature reveals that through paired presentations of an odor A and a sugar reward (A+) the animals learn that the reward can be found where the odor is, and therefore show an above-baseline preference for the odor. In contrast, through unpaired training (A/+) the animals learn that the reward can be found precisely where the odor is not, and accordingly these larvae show a below-baseline preference for it (the same is the case, with inverted signs, for learning through taste punishment). In addition, we present previously unpublished data demonstrating that also during a two-odor, differential conditioning protocol (A+/B) both these learning processes take place in larvae, i.e., learning about both the rewarded stimulus A and the non-rewarded stimulus B (again, this is likewise the case for differential conditioning with taste punishment). Secondly, after briefly discussing published evidence from adult Drosophila, honeybees, and rats, we report an unpublished data set showing that relative to baseline behavior after truly random presentations of a visual stimulus A and punishment, rats exhibit memories of opposite valence upon paired and unpaired training. Collectively, the evidence conforms to classical findings in experimental psychology and suggests that across species animals associatively learn both through paired and through unpaired presentations of stimuli with reinforcement – with opposite valence. While the brain mechanisms of unpaired learning for the most part still need to be uncovered, the immediate implication is that using unpaired procedures as a mnemonically neutral control for associative reinforcement learning may be leading analyses astray.
Highlights
Finding rewards and avoiding punishments are powerful goals of behavior in insects and vertebrates, including humans
The evidence presented from larval and adult Drosophila, honeybees, and rats confirms a general principle of classical experimental psychology: that animals learn through both paired and unpaired presentations of a stimulus A with reinforcement, and that the resulting associative memories are opposite in valence
This warns against using the unpairing of A with a reward or punishment as a control for the effects of associative learning
Summary
Finding rewards and avoiding punishments are powerful goals of behavior in insects and vertebrates, including humans. What the presence of the reward during the test does is to prevent the behavioral expression of associative memory, i.e., to abolish the difference in odor preference between paired-trained and unpaired-trained animals (Figure 1, green box plots). Given that innate olfactory behavior in experimentally naive animals is not likewise affected by the presence of punishing tastants (Figure 2), this makes it possible to measure odor preference after paired or unpaired punishment training, and to compare the levels of preference against baseline – which in this case is determined by testing the animals in the absence of the punishment. One suggested approach to measuring positively valenced memory after unpaired training takes advantage of the contextual learning capabilities of rodents (Ostroff et al, 2010; Pollak et al, 2010; Kong et al, 2014) In these experiments, the test takes place in a context in which the animals have previously received foot-shock punishment. From experiments using startle modulation as a bivalent behavioral read-out and the truly random procedure to determine baseline behavior, we conclude that paired and unpaired training establish oppositely valenced associative memories in rats
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.