Feature Representations for Neuromorphic Audio Spike Streams.

Jithendar Anumula,Shih-Chii Liu,Tobi Delbruck,Daniel Neil

doi:10.3389/fnins.2018.00023

Jithendar Anumula, Shih-Chii Liu + Show 2 more

Open Access

https://doi.org/10.3389/fnins.2018.00023

Copy DOI

Abstract

Event-driven neuromorphic spiking sensors such as the silicon retina and the silicon cochlea encode the external sensory stimuli as asynchronous streams of spikes across different channels or pixels. Combining state-of-art deep neural networks with the asynchronous outputs of these sensors has produced encouraging results on some datasets but remains challenging. While the lack of effective spiking networks to process the spike streams is one reason, the other reason is that the pre-processing methods required to convert the spike streams to frame-based features needed for the deep networks still require further investigation. This work investigates the effectiveness of synchronous and asynchronous frame-based features generated using spike count and constant event binning in combination with the use of a recurrent neural network for solving a classification task using N-TIDIGITS18 dataset. This spike-based dataset consists of recordings from the Dynamic Audio Sensor, a spiking silicon cochlea sensor, in response to the TIDIGITS audio dataset. We also propose a new pre-processing method which applies an exponential kernel on the output cochlea spikes so that the interspike timing information is better preserved. The results from the N-TIDIGITS18 dataset show that the exponential features perform better than the spike count features, with over 91% accuracy on the digit classification task. This accuracy corresponds to an improvement of at least 2.5% over the use of spike count features, establishing a new state of the art for this dataset.

Highlights

The event processing methods for the asynchronous spikes of event-based sensors such as the Dynamic Vision Sensor (DVS) (Lichtsteiner et al, 2008; Berner et al, 2013; Posch et al, 2014; Yang et al, 2015) and the Dynamic Audio Sensor (DAS) (Liu et al, 2014; Yang et al, 2016) fall roughly into two categories: either by the use of neural network methods or machine learning algorithms
We present the network accuracy results of the different pre– processing methods on the audio classification tasks based on the N-TIDIGITS18 dataset when these features are presented to the different recurrent models
We performed a comparative study of the performance accuracy of a gated recurrent neural network that processes either the raw audio spikes or framed features extracted by different spike processing methods

Summary

Introduction

The event processing methods for the asynchronous spikes of event-based sensors such as the Dynamic Vision Sensor (DVS) (Lichtsteiner et al, 2008; Berner et al, 2013; Posch et al, 2014; Yang et al, 2015) and the Dynamic Audio Sensor (DAS) (Liu et al, 2014; Yang et al, 2016) fall roughly into two categories: either by the use of neural network methods or machine learning algorithms. By using conversion methods that convert pre-trained standard deep networks into equivalent-accurate spiking networks (Diehl et al, 2015; Rueckauer et al, 2017) or by using the training methods from deep learning on networks that capture the underlying parameters of the spiking neuron (O’Connor et al, 2013; Stromatias et al, 2015), we are starting to see spiking deep networks that can be competitive with the standard deep networks

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Neuroscience	Publication Date: Feb 9, 2018
Citations: 86	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Feature Representations for Neuromorphic Audio Spike Streams.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience

Lead the way for us

Similar Papers

Domain-invariant representation learning using an unsupervised domain adversarial adaptation deep neural network
Xibin Jia ... Yongli Hu
Neurocomputing | VOL. 355
Xibin Jia, et. al.Xibin Jia ... Yongli Hu
10 May 2019
Neurocomputing | VOL. 355

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks
Xin Liu ... Zhisong Pan
Information Sciences | VOL. 612
Xin Liu, et. al.Xin Liu ... Zhisong Pan
05 Sep 2022
Information Sciences | VOL. 612

On random matrices arising in deep neural networks: General I.I.D. case
Leonid Pastur ... Victor Slavin
Random Matrices: Theory and Applications | VOL. 12
Leonid Pastur, et. al.Leonid Pastur ... Victor Slavin
14 Jul 2022
Random Matrices: Theory and Applications | VOL. 12

Speech Recognition Based on Deep Tensor Neural Network and Multifactor Feature
Yahui Shan ... Jing Wang
-
Yahui Shan, et. al.Yahui Shan ... Jing Wang
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Representations for Neuromorphic Audio Spike Streams.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience