AdaNS: Adaptive Non-Uniform Sampling for Automated Design of Compact DNNs

Mojan Javaheripi,Farinaz Koushanfar,Mohammad Samragh,Tara Javidi

doi:10.1109/jstsp.2020.2992384

Abstract

This paper introduces an adaptive sampling methodology for automated compression of Deep Neural Networks (DNNs) for accelerated inference on resource-constrained platforms. Modern DNN compression techniques comprise various hyperparameters that require per-layer customization. Our objective is to locate an optimal hyperparameter configuration that leads to lowest model complexity while adhering to a desired inference accuracy. We design a score function that evaluates the aforementioned optimality. The optimization problem is then formulated as searching for the maximizers of this score function. To this end, we devise a non-uniform adaptive sampler that aims at reconstructing the band-limited score function. We reduce the total number of required objective function evaluations by realizing a targeted sampler. We propose three adaptive sampling methodologies, i.e., AdaNS-Zoom, AdaNS-Genetic, and AdaNS-Gaussian, where new batches of samples are chosen based on the history of previous evaluations. Our algorithms start sampling from a uniform distribution over the entire search-space and iteratively adapt the sampling distribution to achieve highest density around the function maxima. This, in turn, allows for a low-error reconstruction of the objective function around its maximizers. Our extensive evaluations corroborate AdaNS effectiveness by outperforming existing rule-based and Reinforcement Learning methods in terms of DNN compression rate and/or inference accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AdaNS: Adaptive Non-Uniform Sampling for Automated Design of Compact DNNs

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Signal Processing

Lead the way for us

Journal: IEEE Journal of Selected Topics in Signal Processing	Publication Date: May 1, 2020
Citations: 50

Similar Papers

Smart-DNN+: A Memory-efficient Neural Networks Compression Framework for the Model Inference
Donglei Wu ... Zhenbo Hu
ACM Transactions on Architecture and Code Optimization | VOL. 20
Donglei Wu, et. al.Donglei Wu ... Zhenbo Hu
26 Oct 2023
ACM Transactions on Architecture and Code Optimization | VOL. 20

AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles
Sicong Liu ... Junzhao Du
IEEE Transactions on Mobile Computing | VOL. 20
Sicong Liu, et. al.Sicong Liu ... Junzhao Du
04 Jun 2020
IEEE Transactions on Mobile Computing | VOL. 20

Structured Compression of Deep Neural Networks with Debiased Elastic Group LASSO
Oyebade K Oyedotun ... Bjorn Ottersten
-
Oyebade K Oyedotun, et. al.Oyebade K Oyedotun ... Bjorn Ottersten
01 Mar 2020
01 Mar 2020

Dynamic and Adaptive Threshold for DNN Compression from Scratch
Chunhui Jiang ... Chao Qian
-
Chunhui Jiang, et. al.Chunhui Jiang ... Chao Qian
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AdaNS: Adaptive Non-Uniform Sampling for Automated Design of Compact DNNs

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Signal Processing