SNN-BERT: Training-efficient Spiking Neural Networks for energy-efficient BERT

Qiaoyi Su,Shijie Mei,Xingrun Xing,Man Yao,Jiajun Zhang,Bo Xu,Guoqi Li

doi:10.1016/j.neunet.2024.106630

Abstract

Spiking Neural Networks (SNNs) are naturally suited to process sequence tasks such as NLP with low power, due to its brain-inspired spatio-temporal dynamics and spike-driven nature. Current SNNs employ ”repeat coding” that re-enter all input tokens at each timestep, which fails to fully exploit temporal relationships between the tokens and introduces memory overhead. In this work, we align the number of input tokens with the timestep and refer to this input coding as ”individual coding”. To cope with the increase in training time for individual encoded SNNs due to the dramatic increase in timesteps, we design a Bidirectional Parallel Spiking Neuron (BPSN) with following features: First, BPSN supports spike parallel computing and effectively avoids the issue of uninterrupted firing; Second, BPSN excels in handling adaptive sequence length tasks, which is a capability that existing work does not have; Third, the fusion of bidirectional information enhances the temporal information modeling capabilities of SNNs; To validate the effectiveness of our BPSN, we present the SNN-BERT, a deep direct training SNN architecture based on the BERT model in NLP. Compared to prior repeat 4-timestep coding baseline, our method achieves a 6.46× reduction in energy consumption and a significant 16.1% improvement, raising the performance upper bound of the SNN domain on the GLUE dataset to 74.4%. Additionally, our method achieves 3.5× training acceleration and 3.8× training memory optimization. Compared with artificial neural networks of similar architecture, we obtain comparable performance but up to 22.5× energy efficiency. We would provide the codes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SNN-BERT: Training-efficient Spiking Neural Networks for energy-efficient BERT

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

Toward Reflective Spiking Neural Networks Exploiting Memristive Devices.
Valeri A Makarov ... Viktor B Kazantsev
Frontiers in computational neuroscience | VOL. 16
Valeri A Makarov, et. al.Valeri A Makarov ... Viktor B Kazantsev
16 Jun 2022
Frontiers in computational neuroscience | VOL. 16

Benchmarking Artificial Neural Network Architectures for High-Performance Spiking Neural Networks.
Riadul Islam ... Sri Ranga Sai Krishna Tummala
Sensors | VOL. 24
Riadul Islam, et. al.Riadul Islam ... Sri Ranga Sai Krishna Tummala
19 Feb 2024
Sensors | VOL. 24

High-performance deep spiking neural networks with 0.3 spikes per neuron
Ana Stanojevic ... Wulfram Gerstner
Nature Communications | VOL. 15
Ana Stanojevic, et. al.Ana Stanojevic ... Wulfram Gerstner
09 Aug 2024
Nature Communications | VOL. 15

SATA: Sparsity-Aware Training Accelerator for Spiking Neural Networks
Ruokai Yin ... Abhishek Moitra
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 42
Ruokai Yin, et. al.Ruokai Yin ... Abhishek Moitra
01 Jun 2023
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SNN-BERT: Training-efficient Spiking Neural Networks for energy-efficient BERT

Abstract

Talk to us

Similar Papers

More From: Neural Networks