Direct Feedback Alignment With Sparse Connections for Local Learning.

Brian Crafton,Evan Gebhardt,Arijit Raychowdhury,Abhinav Parihar

doi:10.3389/fnins.2019.00525

Brian Crafton, Evan Gebhardt + Show 2 more

Open Access

https://doi.org/10.3389/fnins.2019.00525

Copy DOI

Journal: Frontiers in neuroscience	Publication Date: May 24, 2019
Citations: 41	License type: CC BY 4.0

Affiliation: Georgia Institute of Technology

Abstract

Recent advances in deep neural networks (DNNs) owe their success to training algorithms that use backpropagation and gradient-descent. Backpropagation, while highly effective on von Neumann architectures, becomes inefficient when scaling to large networks. Commonly referred to as the weight transport problem, each neuron's dependence on the weights and errors located deeper in the network require exhaustive data movement which presents a key problem in enhancing the performance and energy-efficiency of machine-learning hardware. In this work, we propose a bio-plausible alternative to backpropagation drawing from advances in feedback alignment algorithms in which the error computation at a single synapse reduces to the product of three scalar values. Using a sparse feedback matrix, we show that a neuron needs only a fraction of the information previously used by the feedback alignment algorithms. Consequently, memory and compute can be partitioned and distributed whichever way produces the most efficient forward pass so long as a single error can be delivered to each neuron. We evaluate our algorithm using standard datasets, including ImageNet, to address the concern of scaling to challenging problems. Our results show orders of magnitude improvement in data movement and 2× improvement in multiply-and-accumulate operations over backpropagation. Like previous work, we observe that any variant of feedback alignment suffers significant losses in classification accuracy on deep convolutional neural networks. By transferring trained convolutional layers and training the fully connected layers using direct feedback alignment, we demonstrate that direct feedback alignment can obtain results competitive with backpropagation. Furthermore, we observe that using an extremely sparse feedback matrix, rather than a dense one, results in a small accuracy drop while yielding hardware advantages. All the code and results are available under https://github.com/bcrafton/ssdfa.

Highlights

The demise of Dennard scaling (Dennard et al, 1974) and decline of Moore’s Law (Moore, 1965) have exposed the fundamental scaling limitations of the von Neumann models of computing
We show a modified version of Direct Feedback Alignment (DFA), sparse direct feedback alignment (SDFA), where we propose that sparse feedback of the error signals can result in small drop in the network’s performance but significantly reduces the computational complexity during learning
In an extreme version of SDFA, we demonstrate that even a single error feedback signal can enable the network to learn with a small performance loss

Summary

Introduction

The demise of Dennard scaling (Dennard et al, 1974) and decline of Moore’s Law (Moore, 1965) have exposed the fundamental scaling limitations of the von Neumann models of computing. This transition is accompanied by the realization that in a fast evolving, socially interconnected world, we are observing a seismic shift in the amount of unstructured data that need to be processed in Sparse Feedback Alignment real-time (Najafabadi et al, 2015) which has heralded the third wave of Artificial Intelligence and the exponential growth of Machine Learning in data-analytics, real-time control, computer vision, robotics, and so on. Moving forward, computing technology will heavily penalize separation of data and compute and we need to marry them in better ways to handle emergent applications

Methods

Results

Conclusion