Two-Way Transpose Multibit 6T SRAM Computing-in-Memory Macro for Inference-Training AI Edge Chips

Jian‐Wei Su ,Yung-Ning Tu,Fu-Chun Chang,Yen-Lin Chung,Yuan Wu,Ren-Shuo Liu,Jin-Sheng Ren,Hongwu Jiang,Chih-Cheng Hsieh,Sih-Han Li,Pei-Jung Lu,Ruhui Liu,Ting-Wei Chang,Xin Si,Shanshi Huang,Chung-Chuan Lo,Shyh-Shyuan Sheu,Yen-Chi Chou,Shimeng Yu,Wei-Hsing Huang,Chih-I Wu,Kea-Tiong Tang,Chun–Jen Liu ,Jinghong Wang ,Meng‐Fan Chang

doi:10.1109/jssc.2021.3108344

Abstract

Computing-in-memory (CIM) based on SRAM is a promising approach to achieving energy-efficient multiply-and-accumulate (MAC) operations in artificial intelligence (AI) edge devices; however, existing SRAM-CIM chips support only DNN inference. The flow of training data requires that CIM arrays perform convolutional computation using transposed weight matrices. This article presents a two-way transpose (TWT) multiply cell with high resistance to process variation and a novel read scheme that uses input-aware zone prediction of maximum partial MAC values to enhance the signal margin for robust readout. A 28-nm 64-kb TWT CIM macro fabricated using foundry-provided compact 6T-SRAM cells achieved <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$T_{\text {AC}}$ </tex-math></inline-formula> of 3.8–21 ns and energy efficiency of 7–61.1 TOPS/W in performing MAC operations using 2–8-b inputs, 4–8-b weights, and 10–20-b outputs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-Way Transpose Multibit 6T SRAM Computing-in-Memory Macro for Inference-Training AI Edge Chips

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Solid-State Circuits

Lead the way for us

Journal: IEEE Journal of Solid-State Circuits	Publication Date: Feb 1, 2022
Citations: 29

Similar Papers

24.1 A 1Mb Multibit ReRAM Computing-In-Memory Macro with 14.6ns Parallel MAC Computing Time for CNN Based AI Edge Processors
Cheng-Xin Xue ...
-
Cheng-Xin Xue, et. al.Cheng-Xin Xue ...
01 Feb 2019
24.1 A 1Mb Multibit ReRAM Computing-In-Memory Macro with 14.6ns Parallel MAC Computing Time for CNN Based AI Edge Processors
Cheng-Xin Xue ...

Embedded 1-Mb ReRAM-Based Computing-in- Memory Macro With Multibit Input and Weight for CNN-Based AI Edge Processors
Cheng-Xin Xue ...
IEEE Journal of Solid-State Circuits | VOL. 55
Cheng-Xin Xue, et. al.Cheng-Xin Xue ...
05 Dec 2019
IEEE Journal of Solid-State Circuits | VOL. 55

AND8T SRAM Macro with Improved Linearity for Multi-Bit In-Memory Computing
Vishal Sharma ... Ju Eon Kim
-
Vishal Sharma, et. al.Vishal Sharma ... Ju Eon Kim
01 May 2021
01 May 2021

A Reconfigurable 16Kb AND8T SRAM Macro With Improved Linearity for Multibit Compute-In Memory of Artificial Intelligence Edge Devices
Vishal Sharma ... Hyunjoon Kim
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 12
Vishal Sharma, et. al.Vishal Sharma ... Hyunjoon Kim
01 Jun 2022
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Way Transpose Multibit 6T SRAM Computing-in-Memory Macro for Inference-Training AI Edge Chips

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Solid-State Circuits