System-Technology Codesign of 3-D NAND Flash-Based Compute-in-Memory Inference Engine

Wonbo Shim,Shimeng Yu

doi:10.1109/jxcdc.2021.3093772

Abstract

Due to its ultrahigh density and commercially matured fabrication technology, 3-D NAND flash memory has been proposed as an attractive candidate of inference engine for deep neural network (DNN) workloads. However, the peripheral circuits require to be modified with conventional 3-D NAND flash to enable compute-in-memory (CIM), and the chip architectures need to be redesigned for an optimized dataflow. In this work, we present a design of 3-D NAND-CIM accelerator based on the macro parameters from an industry-grade prototype chip. The DNN inference performance is evaluated using the DNN+NeuroSim framework. To exploit the ultrahigh density of 3-D NAND flash, both inputs and weights mapping strategies are introduced to improve the throughput. The benchmarking on the VGG network was performed across the technological candidates for CIM, including SRAM, resistive random access memory (RRAM), and 3-D NAND. Compared to the similar designs with SRAM or RRAM, the result shows that the 3-D NAND-based CIM design can achieve not only 17%–24% chip size but also 1.9–2.7 times more competitive energy efficiency for 8-bit precision inference. Inference accuracy drop induced by 3-D NAND string current drift and variation is also investigated. No accuracy degradation by current variation was observed with the proposed input mapping scheme, while accuracy drops sensitive to the current drift, which implies that some compensation schemes are needed to maintain the inference accuracy.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits	Publication Date: Jun 1, 2021
Citations: 5	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

System-Technology Codesign of 3-D NAND Flash-Based Compute-in-Memory Inference Engine

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits

Lead the way for us

Similar Papers

Architectural Design of 3D NAND Flash based Compute-in-Memory for Inference Engine
Wonbo Shim ... Hongwu Jiang
-
Wonbo Shim, et. al.Wonbo Shim ... Hongwu Jiang
28 Sep 2020
28 Sep 2020

(鎳、鈦與鎢)氧化物之電性及應用於電阻式隨機記憶體研究

-

01 Jan 2012
01 Jan 2012

Design of Hybrid SSDs With Storage Class Memory and NAND Flash Memory
Chihiro Matsui ... Ken Takeuchi
Proceedings of the IRE | VOL. 105
Chihiro Matsui, et. al.Chihiro Matsui ... Ken Takeuchi
01 Sep 2017
Proceedings of the IRE | VOL. 105

Improving the accuracy and robustness of RRAM-based in-memory computing against RRAM hardware noise and adversarial attacks
Sai Kiran Cherupally ... Jae-Sun Seo
Semiconductor Science and Technology | VOL. 37
Sai Kiran Cherupally, et. al.Sai Kiran Cherupally ... Jae-Sun Seo
13 Jan 2022
Semiconductor Science and Technology | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

System-Technology Codesign of 3-D NAND Flash-Based Compute-in-Memory Inference Engine

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits