Stochastic learning in deep neural networks based on nanoscale PCMO device characteristics

Anakha V Babu,Udayan Ganguly,Sandip Lashkare,Bipin Rajendran

doi:10.1016/j.neucom.2018.09.019

Abstract

Abstract Deep Neural Networks (DNN) have proven to be highly effective in extracting high level abstractions of input data using multiple neural network layers. However, the huge training times for DNNs in traditional von-Neumann machines have hindered their ubiquitous adoption in IoT and other mobile computing platforms. Recently, acceleration of DNN with a time complexity of O ( 1 ) was proposed using the idea of stochastic weight update with resistive processing units (RPU). However, it has been projected that RPU devices require more than 1000 reliable conductance levels, which is a stringent requirement to realize in memristive devices. Here, we study the optimization of stochastic learning for DNNs for the hand-written digit classification benchmark using the characteristics of non-filamentary Pr0.7Ca0.3MnO3 (PCMO) devices that are fabricated using a standard lithography process. The electrical characteristics of these devices exhibit a linear conductance response with an on-off ratio of 1.8 with 26 levels and significant programming variability. We captured these non-ideal behaviors of experimental PCMO device in the simulations to demonstrate stochastic learning with O ( 1 ) time complexity, achieving a test accuracy of 88.1% for the hand-written digit recognition benchmark. While the linearity, dynamic range, bit resolution, programming variability and the reset rate of the device conductance to account for its limited dynamic range have to be co-optimized for improving the training efficiency, we show that programming variability has the paramount role in determining the network performance. We also show that if devices with reduced programming variability (5x smaller compared to our experimental device) can be developed keeping all other parameters constant, it is possible to boost the network performance as high as 95%. We also observe that the performance of stochastic DNNs with memristive synapses is independent of the on-off ratio of the devices for a fixed programming variability. Thus, programming variability represents a new optimization corner for on-chip learning of stochastic DNNs. Further, we also evaluate the performance of stochastic inference engines to noise corrupted input test data as a function of the variability in the memristive devices. We demonstrate that noise-resilient inference engines can be achieved if 100 bits are used for stochastic encoding during inference even though the expensive network training can be done with as few as 10 bits. Thus, our studies emphasize the need for optimization of learning strategies for DNNs based on the non-ideal characteristics of experimental nanoscale devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic learning in deep neural networks based on nanoscale PCMO device characteristics

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Sep 20, 2018
Citations: 14

Similar Papers

An Error Compensation Technique for Low-Voltage DNN Accelerators
Daehan Ji ... Jongsun Park
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 29
Daehan Ji, et. al.Daehan Ji ... Jongsun Park
15 Dec 2020
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 29

A Reconfigurable Deep Neural Network on Chip Design with Flexible Convolutional Operations
Kun-Chih Chen ... Yi-Sheng Liao
-
Kun-Chih Chen, et. al.Kun-Chih Chen ... Yi-Sheng Liao
02 Oct 2022
02 Oct 2022

Fast FPGA-Based Emulation for ReRAM-Enabled Deep Neural Network Accelerator
Yongquan Shi ... Guanghui He
-
Yongquan Shi, et. al.Yongquan Shi ... Guanghui He
01 May 2021
01 May 2021

Stochastic deep learning in memristive networks
Anakha V Babu ... Bipin Rajendran
-
Anakha V Babu, et. al.Anakha V Babu ... Bipin Rajendran
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic learning in deep neural networks based on nanoscale PCMO device characteristics

Abstract

Talk to us

Similar Papers

More From: Neurocomputing