DASS: Differentiable Architecture Search for Sparse Neural Networks

Hamid Mousavi,Mina Alibeigi,Mohammad Loni,Masoud Daneshtalab

doi:10.1145/3609385

Abstract

The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available computational power. While recent research has made significant strides in developing pruning methods to build a sparse network for reducing the computing overhead of DNNs, there remains considerable accuracy loss, especially at high pruning ratios. We find that the architectures designed for dense networks by differentiable architecture search methods are ineffective when pruning mechanisms are applied to them. The main reason is that the current methods do not support sparse architectures in their search space and use a search objective that is made for dense networks and does not focus on sparsity. This paper proposes a new method to search for sparsity-friendly neural architectures. It is done by adding two new sparse operations to the search space and modifying the search objective. We propose two novel parametric SparseConv and SparseLinear operations in order to expand the search space to include sparse operations. In particular, these operations make a flexible search space due to using sparse parametric versions of linear and convolution operations. The proposed search objective lets us train the architecture based on the sparsity of the search space operations. Quantitative analyses demonstrate that architectures found through DASS outperform those used in the state-of-the-art sparse networks on the CIFAR-10 and ImageNet datasets. In terms of performance and hardware effectiveness, DASS increases the accuracy of the sparse version of MobileNet-v2 from 73.44% to 81.35% (+7.91% improvement) with a 3.87× faster inference time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DASS: Differentiable Architecture Search for Sparse Neural Networks

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Journal: ACM Transactions on Embedded Computing Systems	Publication Date: Sep 9, 2023
Citations: 6

Similar Papers

An Ordered Aggregation-Based Ensemble Selection Method of Lightweight Deep Neural Networks With Random Initialization
Lin He ... Lijun Peng
IEEE Access | VOL. 10
Lin He, et. al.Lin He ... Lijun Peng
01 Jan 2021
IEEE Access | VOL. 10

Enhancing the Security of Collaborative Deep Neural Networks: An Examination of the Effect of Low Pass Filters
Adewale A Adeyemo ... Syed Rafay Hasan
-
Adewale A Adeyemo, et. al.Adewale A Adeyemo ... Syed Rafay Hasan
05 Jun 2023
05 Jun 2023

Conflict-Resilient Incremental Offloading of Deep Neural Networks to the Edge of Smart Environment
Zhongmin Chen ... Limin Liu
Mathematical Problems in Engineering | VOL. 2021
Zhongmin Chen, et. al.Zhongmin Chen ... Limin Liu
07 Jun 2021
Mathematical Problems in Engineering | VOL. 2021

Deep learning acceleration on edge devices with algorithm/hardware co-design
Mengshu Sun
-
Mengshu SunMengshu Sun
10 Feb 2023
10 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DASS: Differentiable Architecture Search for Sparse Neural Networks

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems