Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management

Xuechao Wei,Yun Liang,Jason Cong

doi:10.1145/3316781.3317875

Abstract

Deep Neural Networks (DNNs) are becoming more and more complex than before. Previous hardware accelerator designs neglect the layer diversity in terms of computation and communication behavior. On-chip memory resources are underutilized for the memory bounded layers, leading to suboptimal performance. In addition, the increasing complexity of DNN structures makes it difficult to do on-chip memory allocation. To address these issues, we propose a layer conscious memory management framework for FPGA-based DNN hardware accelerators. Our framework exploits the layer diversity and the disjoint lifespan information of memory buffers to efficiently utilize the on-chip memory to improve the performance of the layers bounded by memory and thus the entire performance of DNNs. It consists of four key techniques working coordinately with each other. We first devise a memory allocation algorithm to allocate on-chip buffers for the memory bound layers. In addition, buffer sharing between different layers is applied to improve on-chip memory utilization. Finally, buffer prefetching and splitting are used to further reduce latency. Experiments show that our techniques can achieve 1.36X performance improvement compared with previous designs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

OpenCL library of stream memory components targeting FPGAs
Jasmina Vasiljevic ... Fernando Martinez Vallina
-
Jasmina Vasiljevic, et. al.Jasmina Vasiljevic ... Fernando Martinez Vallina
01 Dec 2015
01 Dec 2015

Efficiency Versus Accuracy: A Review of Design Techniques for DNN Hardware Accelerators
Cecilia Latotzke ... Tobias Gemmeke
IEEE Access | VOL. 9
Cecilia Latotzke, et. al.Cecilia Latotzke ... Tobias Gemmeke
01 Jan 2020
IEEE Access | VOL. 9

HardCompress: A Novel Hardware-based Low-power Compression Scheme for DNN Accelerators
Ayush Arunachalam ... Kanad Basu
-
Ayush Arunachalam, et. al.Ayush Arunachalam ... Kanad Basu
07 Apr 2021
07 Apr 2021

Software-Defined Design Space Exploration for an Efficient DNN Accelerator Architecture
Ye Yu ... Niraj K Jha
IEEE Transactions on Computers | VOL. 70
Ye Yu, et. al.Ye Yu ... Niraj K Jha
01 Jan 2020
IEEE Transactions on Computers | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management

Abstract

Talk to us

Similar Papers