ATRIA: A Bit-Parallel Stochastic Arithmetic Based Accelerator for In-DRAM CNN Processing

Supreeth Mysore Shivanandamurthy,Sayed Ahmad Salehi,Ishan G Thakkar

doi:10.1109/isvlsi51109.2021.00045

Supreeth Mysore Shivanandamurthy, Sayed Ahmad Salehi + Show 1 more

Open Access

PDF Available

https://doi.org/10.1109/isvlsi51109.2021.00045

Copy DOI

Export

Save

Cite

Publication Date: Jul 1, 2021

Citations: 2

Affiliation: University of Kentucky

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

With the rapidly growing use of Convolutional Neural Networks (CNNs) in real-world applications related to machine learning and Artificial Intelligence (Al), several hardware accelerator designs for CNN inference and training have been proposed recently. In this paper, we present ATRIA, a novel bit-pArallel sTochastic aRithmetic based In-DRAM Accelerator for energy-efficient and high-speed inference of CNNs. ATRIA employs light-weight modifications in DRAM cell arrays to implement bit-parallel stochastic arithmetic based acceleration of multiply-accumulate (MAC) operations inside DRAM. ATRIA significantly improves the latency, throughput, and efficiency of processing CNN inferences by performing 16 MAC operations in only five consecutive memory operation cycles. We mapped the inference tasks of four benchmark CNNs on ATRIA to compare its performance with five state-of-the-art in-DRAM CNN accelerators from prior work. The results of our analysis show that ATRIA exhibits only 3.5% drop in CNN inference accuracy and still achieves improvements of up to 3.2× in frames-per-second (FPS) and up to 10× in efficiency (FPS/W/mm <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> ), compared to the best-performing in-DRAM accelerator from prior work.

Full Text