A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity.

Takashi Nakano,Kenji Doya,Makoto Otsuka,Junichiro Yoshimoto

doi:10.1371/journal.pone.0115620

Takashi Nakano, Kenji Doya + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0115620

Copy DOI

Abstract

A theoretical framework of reinforcement learning plays an important role in understanding action selection in animals. Spiking neural networks provide a theoretically grounded means to test computational hypotheses on neurally plausible algorithms of reinforcement learning through numerical simulation. However, most of these models cannot handle observations which are noisy, or occurred in the past, even though these are inevitable and constraining features of learning in real environments. This class of problem is formally known as partially observable reinforcement learning (PORL) problems. It provides a generalization of reinforcement learning to partially observable domains. In addition, observations in the real world tend to be rich and high-dimensional. In this work, we use a spiking neural network model to approximate the free energy of a restricted Boltzmann machine and apply it to the solution of PORL problems with high-dimensional observations. Our spiking network model solves maze tasks with perceptually ambiguous high-dimensional observations without knowledge of the true environment. An extended model with working memory also solves history-dependent tasks. The way spiking neural networks handle PORL problems may provide a glimpse into the underlying laws of neural information processing which can only be discovered through such a top-down approach.

Highlights

When faced with a novel environment, animals learn what actions to make through trial and error
We constructed a spiking neural network model inspired by the free-energy-based reinforcement learning (FERL) framework
Our results show that FERL can be well approximated by a spiking neural networks (SNN) model

Summary

Introduction

When faced with a novel environment, animals learn what actions to make through trial and error. Such reward driven learning with incomplete knowledge of the environment is called reinforcement learning (RL) [1]. Starting from prominent experimental findings which show that reward prediction errors are correlated with dopamine signals [2], many studies have investigated how reinforcement learning algorithms are implemented in the brain [3,4,5]. A Spiking Neural Network Model of Model-Free Reinforcement Learning

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Mar 3, 2015
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Spiking neural network model of free-energy-based reinforcement learning
Takashi Nakano ... Makoto Otsuka
BMC Neuroscience | VOL. 12
Takashi Nakano, et. al.Takashi Nakano ... Makoto Otsuka
18 Jul 2011
BMC Neuroscience | VOL. 12

Drone-assisted automated plant diseases identification using spiking deep conventional neural learning
Kubilay Demir ... Vedat Tümen
AI Communications | VOL. 34
Kubilay Demir, et. al.Kubilay Demir ... Vedat Tümen
10 Sep 2021
AI Communications | VOL. 34

Exploring Spiking Neural Networks in Single and Multi-agent RL Methods
M Saravanan ... Kaushik Dey
-
M Saravanan, et. al.M Saravanan ... Kaushik Dey
01 Nov 2021
01 Nov 2021

Real-time cerebellar neuroprosthetic system based on a spiking neural network model of motor learning
Tao Xu ... Pak Kwan Chan
Journal of Neural Engineering | VOL. 15
Tao Xu, et. al.Tao Xu ... Pak Kwan Chan
16 Jan 2018
Journal of Neural Engineering | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one