Mixed-Precision Deep Learning Based on Computational Memory.

S R Nandakumar,Urs Egger,Abu Sebastian,Giovanni Mariani,Evangelos Eleftheriou,Riduan Khaddam-Aljameh,Theodore Antonakopoulos,Vinay Joshi,Geethan Karunaratne,Bipin Rajendran,Anastasios Petropoulos,Christophe Piveteau,Irem Boybat,Manuel Le Gallo

doi:10.3389/fnins.2020.00406

Abstract

Deep neural networks (DNNs) have revolutionized the field of artificial intelligence and have achieved unprecedented success in cognitive tasks such as image and speech recognition. Training of large DNNs, however, is computationally intensive and this has motivated the search for novel computing architectures targeting this application. A computational memory unit with nanoscale resistive memory devices organized in crossbar arrays could store the synaptic weights in their conductance states and perform the expensive weighted summations in place in a non-von Neumann manner. However, updating the conductance states in a reliable manner during the weight update process is a fundamental challenge that limits the training accuracy of such an implementation. Here, we propose a mixed-precision architecture that combines a computational memory unit performing the weighted summations and imprecise conductance updates with a digital processing unit that accumulates the weight updates in high precision. A combined hardware/software training experiment of a multilayer perceptron based on the proposed architecture using a phase-change memory (PCM) array achieves 97.73% test accuracy on the task of classifying handwritten digits (based on the MNIST dataset), within 0.6% of the software baseline. The architecture is further evaluated using accurate behavioral models of PCM on a wide class of networks, namely convolutional neural networks, long-short-term-memory networks, and generative-adversarial networks. Accuracies comparable to those of floating-point implementations are achieved without being constrained by the non-idealities associated with the PCM devices. A system-level study demonstrates 172 × improvement in energy efficiency of the architecture when used for training a multilayer perceptron compared with a dedicated fully digital 32-bit implementation.

Highlights

Inspired by the adaptive parallel computing architecture of the brain, deep neural networks (DNNs) consist of layers of neurons and weighted interconnections called synapses
This simulator is used to evaluate the efficacy of the MCA on three networks: a convolutional neural network (CNN) for classifying CIFAR-10 dataset images, an LSTM network for character level language modeling, and a generativeadversarial network (GAN) for image synthesis based on the MNIST dataset
We demonstrated via experiments and simulations that the MCA can train phase-change memory (PCM) based analog synapses in Deep neural networks (DNNs) to achieve accuracies comparable to those from the floating-point software training baselines

Summary

INTRODUCTION

Inspired by the adaptive parallel computing architecture of the brain, deep neural networks (DNNs) consist of layers of neurons and weighted interconnections called synapses. This has significant ramifications for device endurance, and the requirements on the number of conductance states to achieve accurate training (Gokmen and Vlasov, 2016; Yu, 2018) This weight update scheme is best suited for fully-connected networks trained one sample at a time and is limited to training with stochastic gradient descent without momentum, which is a severe constraint on its applicability to a wide range of DNNs. The use of convolution layers, weight updates based on a mini-batch of samples as opposed to a single example, optimizers such as ADAM (Kingma and Ba, 2015), and techniques such as batch normalization (Ioffe and Szegedy, 2015) have been crucial for achieving high learning accuracy in recent DNNs. there is a significant body of work in the conventional digital domain using reduced precision arithmetic for accelerating DNN training (Courbariaux et al, 2015; Gupta et al, 2015; Merolla et al, 2016; Hubara et al, 2017; Zhang et al, 2017). We validate the approach through simulations to train a convolutional neural network (CNN) on the CIFAR-10 dataset, a long-short-term-memory (LSTM) network on the Penn Treebank dataset, and a generativeadversarial network (GAN) to generate MNIST digits

Mixed-Precision Computational Memory Architecture

Characterization and Modeling of PCM Devices

Training Experiment for Handwritten

Training Simulations of Larger Networks

DISCUSSION

DATA AVAILABILITY STATEMENT

PCM-Based Hardware Platform

Mixed-Precision Training Experiment

Inference After Training

Training Simulations of Larger Networks With PCM Model

Energy Estimation of MCA and Comparison

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neuroscience	Publication Date: May 12, 2020
Citations: 68	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Mixed-Precision Deep Learning Based on Computational Memory.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neuroscience

Lead the way for us

Similar Papers

Mixed-precision architecture based on computational memory for training deep neural networks
S R Nandakumar ... Manuel Le Gallo
-
S R Nandakumar, et. al.S R Nandakumar ... Manuel Le Gallo
01 Jan 2018
01 Jan 2018

(Invited) Analog Memory Fully Connected Networks for Deep Neural Network Accelerated Training
Pritish Narayanan ... Stefano Ambrogio
Electrochemical Society Meeting Abstracts | VOL. MA2018-02
Pritish Narayanan, et. al.Pritish Narayanan ... Stefano Ambrogio
23 Jul 2018
Electrochemical Society Meeting Abstracts | VOL. MA2018-02

Hybrid In-Memory Computing Architecture for the Training of Deep Neural Networks
Vinay Joshi ... Jae-Sun Seo
-
Vinay Joshi, et. al.Vinay Joshi ... Jae-Sun Seo
01 May 2021
01 May 2021

Computational memory-based inference and training of deep neural networks
A Sebastian ... S R Nandakumar
-
A Sebastian, et. al.A Sebastian ... S R Nandakumar
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mixed-Precision Deep Learning Based on Computational Memory.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neuroscience