Tailor : Altering Skip Connections for Resource-Efficient Inference

Olivia Weng,Javier Mauricio Duarte,Alireza Khodamoradi,Kristof Denolf,Abarajithan G,Ryan Kastner,Andres Meza,Nojan Sheybani,Vladimir Loncar,Farinaz Koushanfar,Gabriel Marcano

doi:10.1145/3624990

Abstract

Deep neural networks use skip connections to improve training convergence. However, these skip connections are costly in hardware, requiring extra buffers and increasing on- and off-chip memory utilization and bandwidth requirements. In this article, we show that skip connections can be optimized for hardware when tackled with a hardware-software codesign approach. We argue that while a network’s skip connections are needed for the network to learn, they can later be removed or shortened to provide a more hardware-efficient implementation with minimal to no accuracy loss. We introduce Tailor , a codesign tool whose hardware-aware training algorithm gradually removes or shortens a fully trained network’s skip connections to lower the hardware cost. Tailor improves resource utilization by up to 34% for block random access memories (BRAMs), 13% for flip-flops (FFs), and 16% for look-up tables (LUTs) for on-chip, dataflow-style architectures. Tailor increases performance by 30% and reduces memory bandwidth by 45% for a two-dimensional processing element array architecture.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Reconfigurable Technology and Systems	Publication Date: Jan 27, 2024
Citations: 2	License type: public-domain

R Discovery Prime

R Discovery Prime

Tailor : Altering Skip Connections for Resource-Efficient Inference

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Reconfigurable Technology and Systems

Lead the way for us

Similar Papers

HEVC ALF decode complexity analysis and reduction
Madhukar Budagavi ... Minhua Zhou
-
Madhukar Budagavi, et. al.Madhukar Budagavi ... Minhua Zhou
01 Sep 2011
01 Sep 2011

REQ-YOLO
Caiwen Ding ... Yanzhi Wang
-
Caiwen Ding, et. al.Caiwen Ding ... Yanzhi Wang
20 Feb 2019
20 Feb 2019

最佳化MPEG-4和H.264的視訊編碼
...
-
, et. al. ...
01 Jan 2009
最佳化MPEG-4和H.264的視訊編碼
...

Hybrid Fixed-Point/Binary Deep Neural Network Design Methodology for Low-Power Object Detection
Jiun-In Guo ... Jian-Lin Zeng
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 10
Jiun-In Guo, et. al.Jiun-In Guo ... Jian-Lin Zeng
01 Sep 2020
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tailor : Altering Skip Connections for Resource-Efficient Inference

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Reconfigurable Technology and Systems