Surrogate Gradient Research Articles

As the representatives of brain-inspired models at the neuronal level, spiking neural networks (SNNs) have shown great promise in processing spatiotemporal information with intrinsic temporal dynamics. SNNs are expected to further improve their robustness and computing efficiency by introducing top-down attention at the architectural level, which is crucial for the human brain to support advanced intelligence. However, this attempt encounters difficulties in optimizing the attention in SNNs largely due to the lack of annotations. Here, we develop a hybrid network model with a top-down attention mechanism (HTDA) by incorporating an artificial neural network (ANN) to generate attention maps based on the features extracted by a feedforward SNN. The attention map is then used to modulate the encoding layer of the SNN so that it focuses on the most informative sensory input. To facilitate direct learning of attention maps and avoid labor-intensive annotations, we propose a general principle and a corresponding weakly-supervised objective, which promotes the HTDA model to utilize an integral and small subset of the input to give accurate predictions. On this basis, the ANN and the SNN can be jointly optimized by surrogate gradient descent in an end-to-end manner. We comprehensively evaluated the HTDA model on object recognition tasks, which demonstrates strong robustness to adversarial noise, high computing efficiency, and good interpretability. On the widely-adopted CIFAR-10, CIFAR-100, and MNIST benchmarks, the HTDA model reduces firing rates by up to 50% and improves adversarial robustness by up to 10% with comparable or better accuracy compared with the state-of-the-art SNNs. The HTDA model is also verified on dynamic neuromorphic datasets and achieves consistent improvements. This study provides a new way to boost the performance of SNNs by employing a hybrid top-down attention mechanism.

Read full abstract

The brain-inspired Spiking neural networks (SNN) claim to present advantages for visual classification tasks in terms of energy efficiency and inherent robustness. In this work, we explore the impact on network inter-layer sparsity through neural coding schemes and the intrinsic structural parameters of Leaky Integrate-and-Fire (LIF) neurons, which can be a candidate metric for performance evaluation. Towards this, we perform a comparative study of four critical neural coding schemes: rate coding (poisson coding), latency coding, phase coding, and direct coding, as well as 6 LIF neuron intrinsic parameter options for a total of 24 combined parameter schemes. Specifically, the models were trained using a supervised training algorithm with a surrogate gradient, and two adversarial attacks, Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) were applied on a CIFAR10 dataset. We identified the sources of interlayer sparsity in SNN, and quantitatively analyzed the differences in sparsity caused by coding schemes, neuron leakage factors and thresholds. Various aspects of network performance were thoroughly considered in this paper, including inference accuracy, adversarial robustness, and energy efficiency. Our results show that latency coding is the optimum choice in achieving the highest adversarial robustness and energy efficient against low intensity attacks, while rate coding offers the best adversarial robustness against medium and high intensity attacks. The maximum deviations of robustness and efficiency between different coding schemes are 9.35% in VGG5 and 13.59% in VGG9. Increasing the sparsity of spike activity by improving the threshold can bring a short-lived adversarial robustness sweet spot, while excessive sparsity due to changes in threshold and leakage can instead reduce the adversarial robustness. The study reveals the advantages and disadvantages, and design space of SNN in various dimensions, allowing researchers to frame their neuromorphic systems in terms of the coding methods, neuron inherent structure, and model learning capabilities.

Read full abstract

Surrogate Gradient Research Articles

Articles published on Surrogate Gradient

Enhancing spiking neural networks with hybrid top-down attention.

Bio-inspired computing with magnetic skyrmions using deep learning

Encrypted internet traffic classification using a supervised spiking neural network

Recognition of Electromagnetic Signals Based on the Spiking Convolutional Neural Network

Relaxation LIF: A gradient-based spiking neuron for direct training deep spiking neural networks

Surrogate gradients for analog neuromorphic computing

A Hybrid Spiking Neurons Embedded LSTM Network for Multivariate Time Series Learning under Concept-drift Environment

A Comparative Study on the Performance and Security Evaluation of Spiking Neural Networks

StereoSpike: Depth Learning With a Spiking Neural Network

Spiking Neural Networks Trained via Proxy

ConvSNN: A surrogate gradient spiking neural framework for radar gesture recognition

Accurate and efficient time-domain classification with adaptive spiking recurrent neural networks

Integration of Leaky-Integrate-and-Fire Neurons in Standard Machine Learning Architectures to Generate Hybrid Networks: A Surrogate Gradient Approach.

Few-Shot Learning in Spiking Neural Networks by Multi-Timescale Optimization.

Direct training of hardware-friendly weight binarized spiking neural network with surrogate gradient learning towards spatio-temporal event-based dynamic data recognition

Application of the surrogate gradient method for a multi-item single-machine dynamic lot size scheduling problem

Exploring Optimized Spiking Neural Network Architectures for Classification Tasks on Embedded Platforms.

An Efficient Learning Algorithm for Direct Training Deep Spiking Neural Networks

The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.

Spiking Neural Networks—Part II: Detecting Spatio-Temporal Patterns

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Surrogate Gradient Research Articles

Articles published on Surrogate Gradient

Enhancing spiking neural networks with hybrid top-down attention.

Bio-inspired computing with magnetic skyrmions using deep learning

Encrypted internet traffic classification using a supervised spiking neural network

Recognition of Electromagnetic Signals Based on the Spiking Convolutional Neural Network

Relaxation LIF: A gradient-based spiking neuron for direct training deep spiking neural networks

Surrogate gradients for analog neuromorphic computing

A Hybrid Spiking Neurons Embedded LSTM Network for Multivariate Time Series Learning under Concept-drift Environment

A Comparative Study on the Performance and Security Evaluation of Spiking Neural Networks

StereoSpike: Depth Learning With a Spiking Neural Network

Spiking Neural Networks Trained via Proxy

ConvSNN: A surrogate gradient spiking neural framework for radar gesture recognition

Accurate and efficient time-domain classification with adaptive spiking recurrent neural networks

Integration of Leaky-Integrate-and-Fire Neurons in Standard Machine Learning Architectures to Generate Hybrid Networks: A Surrogate Gradient Approach.

Few-Shot Learning in Spiking Neural Networks by Multi-Timescale Optimization.

Direct training of hardware-friendly weight binarized spiking neural network with surrogate gradient learning towards spatio-temporal event-based dynamic data recognition

Application of the surrogate gradient method for a multi-item single-machine dynamic lot size scheduling problem

Exploring Optimized Spiking Neural Network Architectures for Classification Tasks on Embedded Platforms.

An Efficient Learning Algorithm for Direct Training Deep Spiking Neural Networks

The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks.

Spiking Neural Networks—Part II: Detecting Spatio-Temporal Patterns