AirNN: A Featherweight Framework for Dynamic Input-Dependent Approximation of CNNs

Maedeh Hemmat,Azadeh Davoodi,Joshua San Miguel

doi:10.1109/tcad.2020.3033750

Abstract

In this work, we propose AirNN, a novel framework which enables dynamic approximation of an already-trained convolutional neural network (CNN) in hardware during inference. AirNN enables input-dependent approximation of the CNN to achieve energy saving without much degradation in its classification accuracy at runtime. For each input, AirNN uses only a fraction of the CNN’s weights based on that input (with the rest remaining 0) to conduct the inference. Consequently, energy saving is possible due to fewer number of fetches from off-chip memory as well as fewer multiplications for majority of the inputs. To achieve per-input approximation, we propose a clustering algorithm that groups similar weights in the CNN based on their importance, and design an iterative framework that decides dynamically how many clusters of weights should be fetched from off-chip memory for each individual input. We also propose new hardware structures to implement our framework on top of a recently proposed FPGA-based CNN accelerator. In our experiments with popular CNNs, we, on average, show 49% energy saving with less than 3% degradation in classification accuracy due to doing inference with only a fraction of the weights for the majority of the inputs. We also propose a greedy interleaving scheme, implemented in hardware, in order to improve the performance of the iterative procedure and compensate for its latency overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Oct 30, 2020
Citations: 3	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

AirNN: A Featherweight Framework for Dynamic Input-Dependent Approximation of CNNs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Similar Papers

Dynamic Reconfiguration of CNNs for Input-Dependent Approximation
Maedeh Hemmat ... Azadeh Davoodi
-
Maedeh Hemmat, et. al.Maedeh Hemmat ... Azadeh Davoodi
01 Mar 2019
01 Mar 2019

Clinically Relevant Vulnerabilities of Deep Machine Learning Systems for Skin Cancer Diagnosis
Xinyi Du-Harpur ... Magnus D Lynch
Journal of Investigative Dermatology | VOL. 141
Xinyi Du-Harpur, et. al.Xinyi Du-Harpur ... Magnus D Lynch
12 Sep 2020
Journal of Investigative Dermatology | VOL. 141

Ristretto: A Framework for Empirical Study of Resource-Efficient Inference in Convolutional Neural Networks.
Philipp Gysel ... Jon Pimentel
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Philipp Gysel, et. al.Philipp Gysel ... Jon Pimentel
16 Mar 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

An Area Efficient Superconducting Unary CNN Accelerator
Patricia Gonzalez-Guerrero ... Thom Popovici
-
Patricia Gonzalez-Guerrero, et. al.Patricia Gonzalez-Guerrero ... Thom Popovici
05 Apr 2023
05 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AirNN: A Featherweight Framework for Dynamic Input-Dependent Approximation of CNNs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems