A 4.4–75-TOPS/W 14-nm Programmable, Performance- and Precision-Tunable All-Digital Stochastic Computing Neural Network Inference Accelerator

Wojciech Romaszkan,Rahul Garg,Jiyue Yang,Sudhakar Pamarti,Tianmu Li,Puneet Gupta

doi:10.1109/lssc.2022.3200064

Abstract

We present the first programmable and precision-tunable Stochastic Computing (SC) neural network (NN) inference accelerator. The use of SC makes it possible to achieve multiply-accumulate (MAC) density of 38.4k MAC/mm2, enabling a level of spatial data reuse unachievable to conventional, fixed-point architectures. This extensive reuse amortizes the cost of SC conversion and reduces the number of memory accesses, which can otherwise consume significant energy and latency. Our accelerator is a stand-alone architecture, with a custom instruction set architecture (ISA), and support for end-to-end model inference with convolutional and fully-connected layers of variable input and filter sizes. Further, it demonstrates extensive accuracy-latency trade-offs by varying the stream length. The 14nm demonstration chip achieves 2.4 TOPS and 75 TOPS/W peak throughput and energy efficiency, outperforming comparable fixed-point accelerators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A 4.4–75-TOPS/W 14-nm Programmable, Performance- and Precision-Tunable All-Digital Stochastic Computing Neural Network Inference Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Solid-State Circuits Letters

Lead the way for us

Journal: IEEE Solid-State Circuits Letters	Publication Date: Jan 1, 2022
Citations: 5

Similar Papers

Optimizing Neural Network Inference in Edge Robotics by Harnessing FPGA Hardware Acceleration
Kolli Himantha Rao
Journal of Electrical Systems | VOL. 20
Kolli Himantha Rao Kolli Himantha Rao
13 Apr 2024
Journal of Electrical Systems | VOL. 20

Math Doesn't Have to be Hard
Andrew Boutros ... Vaughn Betz
-
Andrew Boutros, et. al.Andrew Boutros ... Vaughn Betz
20 Feb 2019
20 Feb 2019

Memory System Designed for Multiply-Accumulate (MAC) Engine Based on Stochastic Computing
Xinyue Zhang ... Runsheng Wang
-
Xinyue Zhang, et. al.Xinyue Zhang ... Runsheng Wang
01 Jun 2019
01 Jun 2019

GEO: Generation and Execution Optimized Stochastic Computing Accelerator for Neural Networks
Tianmu Li ... Wojciech Romaszkan
-
Tianmu Li, et. al.Tianmu Li ... Wojciech Romaszkan
01 Feb 2021
01 Feb 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A 4.4–75-TOPS/W 14-nm Programmable, Performance- and Precision-Tunable All-Digital Stochastic Computing Neural Network Inference Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Solid-State Circuits Letters