CASH-RAM: Enabling In-Memory Computations for Edge Inference Using Charge Accumulation and Sharing in Standard 8T-SRAM Arrays

Amogh Agrawal,Adarsh Kosta,Sangamesh Kodge,Kaushik Roy,Dong Eun Kim

doi:10.1109/jetcas.2020.3014250

Abstract

Machine Learning (ML) workloads being memory- and compute-intensive, consume large amounts of power running on conventional computing systems, restricting their implementations to large-scale data centers. Transferring large amounts of data from the edge devices to the data centers is not only energy expensive, but sometimes undesirable in security-critical applications. Thus, there is a need for building domain-specific hardware primitives for energy-efficient ML processing at the edge. One such approach - in-memory computing , eliminates frequent and unnecessary data-transfers between the memory and the compute units, by directly computing the data where it is stored. However, the analog nature of computations introduces non-idealities, which degrades the overall accuracy of neural networks. In this paper, we propose an in-memory computing primitive for accelerating dot-products within standard 8T-SRAM caches, using charge-sharing. The inherent parasitic capacitance of the bitlines and sourcelines is used for accumulating analog voltages, which can be sensed for an approximate dot product. The charge sharing approach involves a self-compensation technique which reduces the effects of non-idealities, thereby reducing the errors. Our results for ternary weight neural networks show that using the proposed compensation approaches, the accuracy degradation is within 1% and 5% of the baseline accuracy, for the MNIST and CIFAR-10 dataset, respectively, with an energy-delay product improvement of $38\times $ over the standard von-Neumann computing system. We believe that this work can be used in conjunction with existing mitigation techniques, such as re-training approaches, to further enhance system performance.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Emerging and Selected Topics in Circuits and Systems	Publication Date: Sep 1, 2020
Citations: 43	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

CASH-RAM: Enabling In-Memory Computations for Edge Inference Using Charge Accumulation and Sharing in Standard 8T-SRAM Arrays

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems

Lead the way for us

Similar Papers

Compute-in-Memory Primitives for Energy-Efficient Machine Learning

-

26 Jul 2021
26 Jul 2021

Toward Energy-Efficient Machine Learning: Algorithms and Analog Compute-In-Memory Hardware

-

28 Jul 2021
28 Jul 2021

An energy-efficient 10T SRAM in-memory computing macro for artificial intelligence edge processor
Anil Kumar Rajput ... Gaurav Kaushal
Memories - Materials, Devices, Circuits and Systems | VOL. 5
Anil Kumar Rajput, et. al.Anil Kumar Rajput ... Gaurav Kaushal
23 Aug 2023
Memories - Materials, Devices, Circuits and Systems | VOL. 5

Optimizing Machine Learning Workloads in Collaborative Environments
Behrouz Derakhshan ... Ziawasch Abedjan
-
Behrouz Derakhshan, et. al.Behrouz Derakhshan ... Ziawasch Abedjan
31 May 2020
31 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CASH-RAM: Enabling In-Memory Computations for Edge Inference Using Charge Accumulation and Sharing in Standard 8T-SRAM Arrays

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Emerging and Selected Topics in Circuits and Systems