Adaptive weight-bit inversion for state error reduction for robust and efficient deep neural network inference using MLC NAND Flash

Jaehun Jang,Jong Hwan Ko

doi:10.1016/j.sysarc.2022.102400

Abstract

When Flash memory is used to store the weights of a deep neural network (DNN), the inference accuracy can degrade owing to the state errors of the Flash memory. To protect the weights from state errors, the existing methods rely on an error correction code (ECC) or parity, which can incur power/storage overhead. We propose a weight-bit inversion method that minimizes accuracy loss caused by state errors without using ECC or parity. First, the method applies weight-bit inversion for state elimination (WISE), which removes the most error-prone state from MLC NAND, thereby improving the error robustness and the most significant bit (MSB) page read speed. If the initial accuracy loss caused by the WISE is unacceptable, we apply weight-bit inversion for state error reduction (WISER), which reduces weight mapping to error-prone states with minimum changes in weight value. To further improve the read speed with minimum accuracy loss, we propose an adaptive weight-bit inversion scheme that selectively applies WISE or WISER to the unit of a weight group. The simulation results imply that after 16K program-erase cycles in NAND Flash, WISER reduces the CIFAR-100 accuracy loss by 1.33X for LeNet-5, 2.92X for VGG-16, and 2.74X for Resnet-20 compared with the existing methods. In addition, the adaptive inversion technique improves the read speed by 48.6% without accuracy loss, compared with the WISER-only scheme.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive weight-bit inversion for state error reduction for robust and efficient deep neural network inference using MLC NAND Flash

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture

Lead the way for us

Journal: Journal of Systems Architecture	Publication Date: Jan 15, 2022
Citations: 2

Similar Papers

Subthreshold operation of SONOS analog memory to enable accurate low-power neural network inference
V Agrawal ... L Hinh
-
V Agrawal, et. al.V Agrawal ... L Hinh
03 Dec 2022
03 Dec 2022

WISER: Deep Neural Network Weight-bit Inversion for State Error Reduction in MLC NAND Flash
Jaehun Jang ... Jong Hwan Ko
-
Jaehun Jang, et. al.Jaehun Jang ... Jong Hwan Ko
01 Feb 2021
01 Feb 2021

Dynamic Vpass Controlled Program Scheme and Optimized Erase Vth Control for High Program Inhibition in MLC NAND Flash Memories
Ki-Tae Park ... Myounggon Kang
IEEE Journal of Solid-State Circuits | VOL. 45
Ki-Tae Park, et. al.Ki-Tae Park ... Myounggon Kang
01 Oct 2010
IEEE Journal of Solid-State Circuits | VOL. 45

Median-Pi artificial neural network for forecasting
Erol Egrioglu ... Ali Zafer Dalar
Neural Computing and Applications | VOL. 31
Erol Egrioglu, et. al.Erol Egrioglu ... Ali Zafer Dalar
13 May 2017
Neural Computing and Applications | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive weight-bit inversion for state error reduction for robust and efficient deep neural network inference using MLC NAND Flash

Abstract

Talk to us

Similar Papers

More From: Journal of Systems Architecture