Eidetic: An In-Memory Matrix Multiplication Accelerator for Neural Networks

Charles Eckert,Arun Subramaniyan,Xiaowei Wang,Ravishankar Iyer,Charles Augustine,Reetuparna Das

doi:10.1109/tc.2022.3214151

Abstract

This paper presents the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Eidetic</i> architecture, which is an SRAM-based ASIC neural network accelerator that eliminates the need to continuously load weights from off-chip, while also minimizing the need to go off chip for intermediate results. Using in-situ arithmetic in the SRAM arrays, this architecture can supports a variety of precision types allowing for effective inference. We also present different data mapping policies for matrix-vector based networks (RNN and MLP) on the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Eidetic</i> architecture and describe the tradeoffs involved. With this architecture, multiple layers of a network can be concurrently mapped, storing both the layer weights and intermediate results on-chip, removing the energy and latency penalty of off-chip memory accesses. We evaluate <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Eidetic</i> on Google's Neural Machine Translation System (GNMT) encoder and demonstrate a 17.20× increase in throughput and 7.77× reduction in average latency over a single TPUv2 chip.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Eidetic: An In-Memory Matrix Multiplication Accelerator for Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Jun 1, 2023
Citations: 4

Similar Papers

Baidu Translate: Research and Products
Zhongjun He
-
Zhongjun HeZhongjun He
01 Jan 2015
01 Jan 2015

A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language Directions
Antonio Toral ... Víctor M Sánchez-Cartagena
-
Antonio Toral, et. al.Antonio Toral ... Víctor M Sánchez-Cartagena
01 Jan 2017
01 Jan 2017

English to Arabic Braille Neural Machine Translation Through Corpus Augmentation
Nisheeth Joshi ... Syed Afroz Ahmed
Procedia Computer Science | VOL. 244
Nisheeth Joshi, et. al.Nisheeth Joshi ... Syed Afroz Ahmed
01 Jan 2024
Procedia Computer Science | VOL. 244

Combining Advanced Methods in Japanese-Vietnamese Neural Machine Translation
Thi-Vinh Ngo ... Phuong-Thai Nguyen
-
Thi-Vinh Ngo, et. al.Thi-Vinh Ngo ... Phuong-Thai Nguyen
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Eidetic: An In-Memory Matrix Multiplication Accelerator for Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers