The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.

Friedemann Zenke,Tim P Vogels

doi:10.1162/neco_a_01367

Abstract

Brains process information in spiking neural networks. Their intricate connections shape the diverse functions these networks perform. Yet how network connectivity relates to function is poorly understood, and the functional capabilities of models of spiking networks are still rudimentary. The lack of both theoretical insight and practical algorithms to find the necessary connectivity poses a major impediment to both studying information processing in the brain and building efficient neuromorphic hardware systems. The training algorithms that solve this problem for artificial neural networks typically rely on gradient descent. But doing so in spiking networks has remained challenging due to the nondifferentiable nonlinearity of spikes. To avoid this issue, one can employ surrogate gradients to discover the required connectivity. However, the choice of a surrogate is not unique, raising the question of how its implementation influences the effectiveness of the method. Here, we use numerical simulations to systematically study how essential design parameters of surrogate gradients affect learning performance on a range of classification problems. We show that surrogate gradient learning is robust to different shapes of underlying surrogate derivatives, but the choice of the derivative's scale can substantially affect learning performance. When we combine surrogate gradients with suitable activity regularization techniques, spiking networks perform robust information processing at the sparse activity limit. Our study provides a systematic account of the remarkable robustness of surrogate gradient learning and serves as a practical guide to model functional spiking neural networks.

Highlights

The computational power of deep neural networks (LeCun, Bengio, & Hinton, 2015; Schmidhuber, 2015) has reinvigorated interest in using in-silico systems to study information processing in the brain (Barrett, Morcos, & Macke, 2019; Richards et al, 2019)
Previous studies did not address this question because they solved different computational problems, precluding a direct comparison. We address this issue by providing benchmarks for comparing the trainability of spiking neural networks (SNNs) on a range of supervised learning tasks and systematically vary the shape and scale of the surrogate derivative used for training networks on the same task
Surrogate gradients offer a promising way to instill complex functions in artificial models of spiking networks. This step is imperative for developing brain-inspired neuromorphic hardware and using SNNs as in silico models to study information processing in the brain

Summary

Introduction

The computational power of deep neural networks (LeCun, Bengio, & Hinton, 2015; Schmidhuber, 2015) has reinvigorated interest in using in-silico systems to study information processing in the brain (Barrett, Morcos, & Macke, 2019; Richards et al, 2019). The activity of artificial recurrent neural networks optimized to solve cognitive tasks resembles cortical activity in prefrontal (Cueva et al, 2019; Mante, Sussillo, Shenoy, & Newsome, 2013), medial frontal (Wang, Narain, Hosseini, & Jazayeri, 2018), and motor areas (Michaels, Schaffelhofer, Agudelo-Toro, & Scherberger, 2019; Stroud, Porter, Hennequin, & Vogels, 2018), providing us with new vistas for understanding the dynamic properties of computation in recurrent neural networks (Barrett et al, 2019; Sussillo & Barak, 2012; Williamson, Doiron, Smith, & Yu, 2019). Deep neural networks differ from biological neural networks in important respects They lack cell type diversity and do not obey Dale’s law while ignoring the fact that the brain uses spiking neurons. This is not the case for spiking neural networks (SNNs)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural computation	Publication Date: Mar 26, 2021
Citations: 121	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural computation

Lead the way for us

Similar Papers

Fluctuation-driven initialization for spiking neural network training
Julian Rossbroich ... Julia Gygax
Neuromorphic Computing and Engineering | VOL. 2
Julian Rossbroich, et. al.Julian Rossbroich ... Julia Gygax
01 Dec 2022
Neuromorphic Computing and Engineering | VOL. 2

A surrogate gradient spiking baseline for speech command recognition.
Alexandre Bittar ... Philip N Garner
Frontiers in Neuroscience | VOL. 16
Alexandre Bittar, et. al.Alexandre Bittar ... Philip N Garner
22 Aug 2022
Frontiers in Neuroscience | VOL. 16

Meta-learning spiking neural networks with surrogate gradient descent
Kenneth M Stewart ... Emre O Neftci
Neuromorphic Computing and Engineering | VOL. 2
Kenneth M Stewart, et. al.Kenneth M Stewart ... Emre O Neftci
30 Sep 2022
Neuromorphic Computing and Engineering | VOL. 2

Skipper: Enabling efficient SNN training through activation-checkpointing and time-skipping
Sonali Singh ... Mahmut T Kandemir
-
Sonali Singh, et. al.Sonali Singh ... Mahmut T Kandemir
01 Oct 2022
01 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural computation