Rare Neural Correlations Implement Robotic Conditioning with Delayed Rewards and Disturbances

Andrea Soltoggio,Felix Reinhart,Jochen J Steil,Andre Lemme

doi:10.3389/fnbot.2013.00006

Abstract

Neural conditioning associates cues and actions with following rewards. The environments in which robots operate, however, are pervaded by a variety of disturbing stimuli and uncertain timing. In particular, variable reward delays make it difficult to reconstruct which previous actions are responsible for following rewards. Such an uncertainty is handled by biological neural networks, but represents a challenge for computational models, suggesting the lack of a satisfactory theory for robotic neural conditioning. The present study demonstrates the use of rare neural correlations in making correct associations between rewards and previous cues or actions. Rare correlations are functional in selecting sparse synapses to be eligible for later weight updates if a reward occurs. The repetition of this process singles out the associating and reward-triggering pathways, and thereby copes with distal rewards. The neural network displays macro-level classical and operant conditioning, which is demonstrated in an interactive real-life human-robot interaction. The proposed mechanism models realistic conditioning in humans and animals and implements similar behaviors in neuro-robotic platforms.

Highlights

In reward learning, the results of actions, manifested as rewards or punishments, occur often seconds after the actions that caused them
The present study demonstrates the use of rare neural correlations in making correct associations between rewards and previous cues or actions
This study demonstrates neural robotic conditioning in humanrobot interactive scenarios with delayed rewards, disturbing stimuli, and uncertain timing

Summary

Introduction

The results of actions, manifested as rewards or punishments, occur often seconds after the actions that caused them For this reason, it is not always easy to determine which previous stimuli and actions are causally associated with following rewards. It is not always easy to determine which previous stimuli and actions are causally associated with following rewards This problem was named distal reward problem (Hull, 1943), or credit assignment problem (Sutton and Barto, 1998). This problem and the ability of animals to solve it emerged originally in behavioral psychology (Thorndike, 1911; Pavlov, 1927; Skinner, 1953). The ability of determining such relationships is distinctive of human and animal intelligence

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Neurorobotics	Publication Date: Jan 1, 2013
Citations: 54	License type: cc-by

R Discovery Prime

R Discovery Prime

Rare Neural Correlations Implement Robotic Conditioning with Delayed Rewards and Disturbances

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neurorobotics

Lead the way for us

Similar Papers

Artificial brain. Biological and artificial neural networks, advantages, disadvantages, and prospects for development
V.O Yashchenko
Mathematical machines and systems | VOL. 2
V.O YashchenkoV.O Yashchenko
01 Jan 2023
Mathematical machines and systems | VOL. 2

Reconfigurable logic gates in biological crossbar neural networks using STDP learning.
Yonghee Bae ... Kyo-Seok Lee
Biophysical Journal | VOL. 122
Yonghee Bae, et. al.Yonghee Bae ... Kyo-Seok Lee
01 Feb 2023
Biophysical Journal | VOL. 122

A New Model of the Neuron for Biological Spiking Neural Network Suitable for Parallel Data Processing Realized in Hardware
Aleksandra Świetlicka ... Rafał Długosz
Solid State Phenomena | VOL. 199
Aleksandra Świetlicka, et. al.Aleksandra Świetlicka ... Rafał Długosz
01 Mar 2013
Solid State Phenomena | VOL. 199

Function of biological asymmetrical neural networks
Naohiro Ishii ... Ken-Ichi Naka
-
Naohiro Ishii, et. al.Naohiro Ishii ... Ken-Ichi Naka
01 Jan 1997
01 Jan 1997

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rare Neural Correlations Implement Robotic Conditioning with Delayed Rewards and Disturbances

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neurorobotics