POD-RACING: Bulk-Bitwise to Floating-Point Compute in Racetrack Memory for Machine Learning at the Edge

Sébastien Ollivier,Yue Tang,Chayanika Choudhuri,Jingtong Hu,Alex K Jones,Xinyi Zhang

doi:10.1109/mm.2022.3195761

Abstract

Convolutional neural networks (CNNs) have become a ubiquitous algorithm with growing applications in mobile and edge settings. We describe a compute-in-memory (CIM) technique called POD-RACING using Racetrack memory (RM) to accelerate CNNs for edge systems. Using transverse read, a technique that can determine the number of “1”s in multiple adjacent domains, POD-RACING can efficiently implement multioperand bulk-bitwise and addition computations, and two-operand multiplication. We discuss how POD-RACING can implement both variable precision integer and floating point arithmetic using digital CIM. This allows both CNN inference and on-device training without expensive data movement to the cloud. Based on these functions we demonstrate the implementation of several CNNs with backpropagation using RM CIM and compare these to the state-of-the-art implementations of CNN inference and training. During training, POD-RACING improves efficiency by 2×, energy consumption by $\geq$≥27%, and increases throughput by $\geq$≥18% versus a state-of-the-art field-programmable gate array accelerator.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

POD-RACING: Bulk-Bitwise to Floating-Point Compute in Racetrack Memory for Machine Learning at the Edge

Abstract

Talk to us

Similar Papers

More From: IEEE Micro

Lead the way for us

Journal: IEEE Micro	Publication Date: Nov 1, 2022
Citations: 4

Similar Papers

Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems
Benjamin Chen Ming Choong ... Joey Tianyi Zhou
Journal of Systems Architecture | VOL. 128
Benjamin Chen Ming Choong, et. al.Benjamin Chen Ming Choong ... Joey Tianyi Zhou
05 May 2022
Journal of Systems Architecture | VOL. 128

Aspects of programming for implementation of convolutional neural networks on multisystem HPC architectures
Sunil Pandey ... Shrish Verma
Journal of Physics: Conference Series | VOL. 2062
Sunil Pandey, et. al.Sunil Pandey ... Shrish Verma
01 Nov 2021
Journal of Physics: Conference Series | VOL. 2062

Application of bit-serial arithmetic units for FPGA implementation of convolutional neural networks
G Csordas ... B Feher
-
G Csordas, et. al.G Csordas ... B Feher
01 May 2018
01 May 2018

Strided Convolution Instead of Max Pooling for Memory Efficiency of Convolutional Neural Networks
Riadh Ayachi ... Mouna Afif
-
Riadh Ayachi, et. al.Riadh Ayachi ... Mouna Afif
11 Jul 2019
11 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

POD-RACING: Bulk-Bitwise to Floating-Point Compute in Racetrack Memory for Machine Learning at the Edge

Abstract

Talk to us

Similar Papers

More From: IEEE Micro