Unleashing the Potential of Sparse DNNs Through Synergistic Hardware-Sparsity Co-Design

Elbruz Ozen,Alex Orailoglu

doi:10.1109/tcad.2022.3191561

Abstract

Sparsity, a widely recognized path to curbing the computational needs of deep neural networks (DNNs), still suffers a number of roadblocks in practice, despite a decade of intensive research on sparse neural networks. The extant structured sparsity patterns often fail to attain significant model compression, while the hardware challenges posed by unstructured sparsity are yet to be fully overcome. As algorithmic and hardware innovations individually deliver limited benefits, a synergistic approach is necessary to unleash the potential of sparse DNNs. This work proposes a tightly integrated design methodology for the sparsity patterns and associated hardware platforms to reach the highest model compression goals while simultaneously facilitating efficient hardware processing. We demonstrate that novel complementary sparsity patterns can offer utmost expressiveness levels with inherent hardware exploitable regularity. Our novel dynamic training method converts the expressiveness of such sparsity configurations into highly accurate and compact sparse neural networks. Complementary sparsity is represented in a dense format, and when synergistically coupled with minimal yet strategic hardware modifications, can be processed in close concordance with the conventional dataflow of the dense matrix operations. We thus demonstrate that there is ample room for innovation beyond conventional techniques and immense practical potential for sparse neural networks through the synergistic design of sparsity patterns and hardware architectures.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unleashing the Potential of Sparse DNNs Through Synergistic Hardware-Sparsity Co-Design

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Apr 1, 2023
Citations: 2

Similar Papers

Efficient Sparse Neural Networks Using Regularized Multi Block Sparsity Pattern on a GPU
Dharma Teja Vooturi ... Kishore Kothapalli
-
Dharma Teja Vooturi, et. al.Dharma Teja Vooturi ... Kishore Kothapalli
01 Dec 2019
01 Dec 2019

A CPU-based algorithm for traffic optimization based on sparse convolutional neural networks
Zhizhou Li ... Justin Eichel
-
Zhizhou Li, et. al. Zhizhou Li ... Justin Eichel
01 Apr 2017
01 Apr 2017

Adapting In Situ Accelerators for Sparsity with Granular Matrix Reordering
Darya Mikhailenko ... Yujin Nakamoto
IEEE Computer Architecture Letters | VOL. 19
Darya Mikhailenko, et. al.Darya Mikhailenko ... Yujin Nakamoto
01 Jul 2020
IEEE Computer Architecture Letters | VOL. 19

Topological Insights into Sparse Neural Networks
Shiwei Liu ... Davide Ferraro
-
Shiwei Liu, et. al.Shiwei Liu ... Davide Ferraro
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unleashing the Potential of Sparse DNNs Through Synergistic Hardware-Sparsity Co-Design

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems