Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices

Tayfun Gokmen,Murat Onen,Wilfried Haensch

doi:10.3389/fnins.2017.00538

Abstract

In a previous work we have detailed the requirements for obtaining maximal deep learning performance benefit by implementing fully connected deep neural networks (DNN) in the form of arrays of resistive devices. Here we extend the concept of Resistive Processing Unit (RPU) devices to convolutional neural networks (CNNs). We show how to map the convolutional layers to fully connected RPU arrays such that the parallelism of the hardware can be fully utilized in all three cycles of the backpropagation algorithm. We find that the noise and bound limitations imposed by the analog nature of the computations performed on the arrays significantly affect the training accuracy of the CNNs. Noise and bound management techniques are presented that mitigate these problems without introducing any additional complexity in the analog circuits and that can be addressed by the digital circuits. In addition, we discuss digitally programmable update management and device variability reduction techniques that can be used selectively for some of the layers in a CNN. We show that a combination of all those techniques enables a successful application of the RPU concept for training CNNs. The techniques discussed here are more general and can be applied beyond CNN architectures and therefore enables applicability of the RPU approach to a large class of neural network architectures.

Highlights

Deep neural network (DNN) (LeCun et al, 2015) based models have demonstrated unprecedented accuracy, in cases exceeding human level performance, in cognitive tasks such as object recognition (Krizhevsky et al, 2012; He et al, 2015; Simonyan and Zisserman, 2015; Szegedy et al, 2015), speech recognition (Hinton et al, 2012), and natural language processing (Collobert et al, 2012)
Our analysis shows that the larger test error is mainly due to contributions of analog noise introduced during the backward cycle, and signal bounds introduced in the forward cycle on the final Resistive Processing Unit (RPU) array, W4
The combination of all of the management techniques with the 13-device mapping on the second convolutional layer (K2) brings the model’s test error to 0.8%. The performance of this final RPU model is almost indistinguishable from the floating point (FP)-baseline model and shows the successful application of RPU approach for training convolutional neural networks (CNNs)

Summary

Introduction

Deep neural network (DNN) (LeCun et al, 2015) based models have demonstrated unprecedented accuracy, in cases exceeding human level performance, in cognitive tasks such as object recognition (Krizhevsky et al, 2012; He et al, 2015; Simonyan and Zisserman, 2015; Szegedy et al, 2015), speech recognition (Hinton et al, 2012), and natural language processing (Collobert et al, 2012) These accomplishments are made possible thanks to the advances in computing architectures and the availability of large amounts of labeled training data. We show that the RPU concept is applicable for CNNs

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Neuroscience	Publication Date: Oct 10, 2017
Citations: 149	License type: cc-by

R Discovery Prime

R Discovery Prime

Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience

Lead the way for us

Similar Papers

Clinically Relevant Vulnerabilities of Deep Machine Learning Systems for Skin Cancer Diagnosis
Xinyi Du-Harpur ... Magnus D Lynch
Journal of Investigative Dermatology | VOL. 141
Xinyi Du-Harpur, et. al.Xinyi Du-Harpur ... Magnus D Lynch
12 Sep 2020
Journal of Investigative Dermatology | VOL. 141

Hybrid morphological-convolutional neural networks for computer-aided diagnosis.
Martha Rebeca Canales-Fiscal ... José Gerardo Tamez-Peña
Frontiers in artificial intelligence | VOL. 6
Martha Rebeca Canales-Fiscal, et. al.Martha Rebeca Canales-Fiscal ... José Gerardo Tamez-Peña
19 Sep 2023
Frontiers in artificial intelligence | VOL. 6

Performance Analysis of Deep-Neural-Network-Based Automatic Diagnosis of Diabetic Retinopathy.
Hassan Tariq ... Muhammad Rashid
Sensors (Basel, Switzerland) | VOL. 22
Hassan Tariq, et. al.Hassan Tariq ... Muhammad Rashid
29 Dec 2021
Sensors (Basel, Switzerland) | VOL. 22

WE‐DE‐207B‐02: Detection of Masses On Mammograms Using Deep Convolutional Neural Network: A Feasibility Study
S Suzuki ... X Zhang
Medical Physics | VOL. 43
S Suzuki, et. al.S Suzuki ... X Zhang
01 Jun 2016
WE‐DE‐207B‐02: Detection of Masses On Mammograms Using Deep Convolutional Neural Network: A Feasibility Study
S Suzuki ... X Zhang

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience