A Heterogeneous Microprocessor Based on All-Digital Compute-in-Memory for End-to-End AIoT Inference

Songming Yu,Wentao Zhao,Mufeng Zhou,Guofu Ma,Luchang Lei,Huazhong Yang,Wenyu Sun,Hongyang Jia,Yifan He,Yongpan Liu

doi:10.1109/tcsii.2023.3249245

Abstract

Deploying neural network (NN) models on Internet-of-Things (IoT) devices is important to enable artificial intelligence (AI) on the edge realizing AI-of-Things (AIoT). However, high energy consumption and bandwidth requirement of NN models restricts AI applications on battery-limited equipments. Compute-In-Memory (CIM), featured with high energy efficiency, provides new opportunities for the IoT deployment of NN. However, the design of CIM-based full system is still at the early stage, lacking system-level demonstration and vertical optimization for running end-to-end AI applications. In this paper, we demonstrate a low-power heterogeneous microprocessor System-on-Chip (SoC) with an all-digital SRAM CIM accelerator and rich data acquisition interfaces for end-to-end AIoT NN inference. A dedicated reconfigurable dataflow controller for CIM computation greatly lowers bandwidth requirement on the system bus and improves execution efficiency. The all-digital SRAM CIM array embeds NAND-based bit-serial multiplication within the readout sense amplifier balancing the storage density and system-level throughput. Our chip achieves a throughput of 12.8 GOPS, with 10 TOPS/W energy efficiency. Benchmarked by the four tasks in MLPerf Tiny, experimental results show 1.8x to 2.9x inference speedup over a baseline CIM processor.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Heterogeneous Microprocessor Based on All-Digital Compute-in-Memory for End-to-End AIoT Inference

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems II: Express Briefs	Publication Date: Aug 1, 2023
Citations: 1

Similar Papers

Design Methodology towards High-Precision SRAM based Computation-in-Memory for AI Edge Devices
Tianzhu Xiong ... Haiming Hsu
-
Tianzhu Xiong, et. al.Tianzhu Xiong ... Haiming Hsu
06 Oct 2021
06 Oct 2021

Focus issue: Artificial intelligence in medical physics.
F Zanca ... O Diaz
Physica Medica | VOL. 83
F Zanca, et. al.F Zanca ... O Diaz
01 Mar 2021
Physica Medica | VOL. 83

A Dual-Split 6T SRAM-Based Computing-in-Memory Unit-Macro With Fully Parallel Product-Sum Operation for Binarized DNN Edge Processors
Xin Si ... Hiroyuki Yamauchi
IEEE Transactions on Circuits and Systems I: Regular Papers | VOL. 66
Xin Si, et. al.Xin Si ... Hiroyuki Yamauchi
01 Nov 2019
IEEE Transactions on Circuits and Systems I: Regular Papers | VOL. 66

Improving Efficiency and Accuracy for Training and Inference of Hardware-aware Machine Learning Systems

-

12 Mar 2020
12 Mar 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Heterogeneous Microprocessor Based on All-Digital Compute-in-Memory for End-to-End AIoT Inference

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs