Stick Buffer Cache v2: Improved Input Feature Map Cache for Reducing off-chip Memory Traffic in CNN Accelerators

Rastislav Struharik,Vuk Vranjkovic

doi:10.1109/telfor48224.2019.8971049

Abstract

Data movement between the Convolutional Neural Network (CNN) accelerators and off-chip memory is critical with respect to the overall power consumption. Minimizing power consumption is particularly important for low power embedded applications. Specific CNN compute patterns offer a possibility of significant data reuse, leading to idea of using specialized on-chip cache memories which enable significant improvement in power consumption. However, due to unique caching pattern present within CNNs, standard cache memories would not be efficient. In this paper novel on-chip cache memory architecture, based on idea of input feature map striping, is proposed, which requires significantly less on-chip memory resources compared to previously proposed solutions. Experiment results show that the proposed cache architecture can reduce on-chip memory size by a factor of 16 or more, while increasing power consumption no more than 15%, compared to some of previously proposed solutions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stick Buffer Cache v2: Improved Input Feature Map Cache for Reducing off-chip Memory Traffic in CNN Accelerators

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Striping input feature map cache for reducing off-chip memory traffic in CNN accelerators
Rastislav Struharik ... Vuk Vranjković
Telfor Journal | VOL. 12
Rastislav Struharik, et. al.Rastislav Struharik ... Vuk Vranjković
01 Jan 2020
Telfor Journal | VOL. 12

Reconfigurable Network-on-Chip based Convolutional Neural Network Accelerator
Arash Firuzan ... Ahmad Khademzadeh
Journal of Systems Architecture | VOL. 129
Arash Firuzan, et. al.Arash Firuzan ... Ahmad Khademzadeh
23 May 2022
Journal of Systems Architecture | VOL. 129

POMMEL: Exploring Off-Chip Memory Energy & Power Consumption in Convolutional Neural Network Accelerators
Alexander Montgomerie-Corcoran ... Christos-Savvas Bouganis
-
Alexander Montgomerie-Corcoran, et. al.Alexander Montgomerie-Corcoran ... Christos-Savvas Bouganis
01 Sep 2021
01 Sep 2021

An Uninterrupted Processing Technique-Based High-Throughput and Energy-Efficient Hardware Accelerator for Convolutional Neural Networks
Md Najrul Islam ... Rahul Shrestha
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 30
Md Najrul Islam, et. al.Md Najrul Islam ... Rahul Shrestha
01 Dec 2022
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stick Buffer Cache v2: Improved Input Feature Map Cache for Reducing off-chip Memory Traffic in CNN Accelerators

Abstract

Talk to us

Similar Papers