Robust quantization of deep neural networks

Youngseok Kim,Younghoon Kim,Junyeol Lee,Jiwon Seo

doi:10.1145/3377555.3377900

Abstract

We studied robust quantization of deep neural networks (DNNs) for embedded devices. Existing compression techniques often generate DNNs that are sensitive to external errors. Because embedded devices may be affected by external lights and outside weather, DNNs running on those devices must be robust to such errors. For robust quantization of DNNs, we formulate an optimization problem that finds the bit width for each layer minimizing the robustness loss. To efficiently find the solution, we design a dynamic programming based algorithm, called Qed. We also propose an incremental algorithm, Q* that quickly finds a reasonably robust quantization and then gradually improves it. We have evaluated Qed and Q* with three DNN models (LeNet, AlexNet, and VGG-16) and with Gaussian random errors and realistic errors. For comparison, we also evaluate universal quantization that uses equal bit width for all layers and Deep Compression, a weight-sharing based compression technique. When tested with increasing size of errors, Qed most robustly gives correct inference output. Even if a DNN is optimized for robustness, its quantizations may not be robust unless Qed is used. Moreover, we evaluate Q* for its trade off in execution time and robustness. In one tenth of Qed’s execution time, Q* gives a quantization 98% as robust as the one by Qed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust quantization of deep neural networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Understanding adversarial attack and defense towards deep compressed neural networks
Qi Liu ... Wujie Wen
-
Qi Liu, et. al.Qi Liu ... Wujie Wen
03 May 2018
03 May 2018

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

A weight-wise watermarking technique for DNN models and its robustness against overwriting attack
Han He ... Seok Kang
-
Han He, et. al.Han He ... Seok Kang
13 Mar 2021
13 Mar 2021

Robustness analysis and experimental validation of a deep neural network for acoustic source imaging
Qing Li ... Yu Liu
Mechanical Systems and Signal Processing | VOL. 216
Qing Li, et. al.Qing Li ... Yu Liu
04 May 2024
Mechanical Systems and Signal Processing | VOL. 216

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust quantization of deep neural networks

Abstract

Talk to us

Similar Papers