Optimizing Off-Chip Memory Access for Deep Neural Network Accelerator

Yong Zheng,Yiping Jia,Haigang Yang,Yi Shu,Zhihong Huang

doi:10.1109/tcsii.2022.3150030

Abstract

Off-chip memory, such as DRAM, its access energy cost is orders of magnitude higher than other operations such as multiply and accumulate, thereby dominating the system energy consumption. Therefore, optimizing the access of the off-chip memory is crucial to further improve the energy efficiency of the deep neural network (DNN) accelerator. Towards this, this brief proposed an adaptive scheduling algorithm to minimize the DRAM access. Compared with the previous works, it can not only dynamically determine the data partition and the data type that will be reused, but also considered the constraints between adjacent layer, that is, if the output feature map of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$i$ </tex-math></inline-formula> th layer is divided into N parts, the output feature map of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$i+1$ </tex-math></inline-formula> th layer can only be divided into N parts or write back to the off-chip memory. Therefore, a minimize and realizable memory access solution can be obtained. Choosing 3 popular networks UNet, VGG-16 and MobileNet as benchmarks, the experiment results show that our scheduling algorithm can achieve a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$34.46\% \sim 93.42\%$ </tex-math></inline-formula> reduction in energy consumption of DRAM access and a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$34.34\% \sim 93.37\%$ </tex-math></inline-formula> reduction in DRAM access latency when compared to previous works.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimizing Off-Chip Memory Access for Deep Neural Network Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems II: Express Briefs	Publication Date: Apr 1, 2022
Citations: 4

Similar Papers

Model Reverse-Engineering Attack using Correlation Power Analysis against Systolic Array Based Neural Network Accelerator
Kota Yoshida ... Takaya Kubota
-
Kota Yoshida, et. al.Kota Yoshida ... Takaya Kubota
01 Oct 2020
01 Oct 2020

Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators.
David Stutz ... Bernt Schiele
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
David Stutz, et. al.David Stutz ... Bernt Schiele
01 Mar 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Dynamic Mapping Mechanism to Compute DNN Models on a Resource-limited NoC Platform
Kun-Chih Jimmy Chen ... Jing-Wen Liang
-
Kun-Chih Jimmy Chen, et. al.Kun-Chih Jimmy Chen ... Jing-Wen Liang
19 Apr 2021
19 Apr 2021

A Reconfigurable Deep Neural Network on Chip Design with Flexible Convolutional Operations
Kun-Chih Chen ... Yi-Sheng Liao
-
Kun-Chih Chen, et. al.Kun-Chih Chen ... Yi-Sheng Liao
02 Oct 2022
02 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimizing Off-Chip Memory Access for Deep Neural Network Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs