Adversarial local distribution regularization for knowledge distillation

Thanh Nguyen-Duc,Jianfei Cai,Trung Le,He Zhao,Dinh Phung

doi:10.1109/wacv56688.2023.00466

Abstract

Knowledge distillation is a process of distilling information from a large model with significant knowledge capacity (teacher) to enhance a smaller model (student). Therefore, exploring the properties of the teacher is the key to improving student performance (e.g., teacher decision boundaries). One decision boundary exploring technique is to leverage adversarial attack methods, which add crafted perturbations within a ball constraint to clean inputs to create attack examples of the teacher called adversarial examples. These adversarial examples are informative examples because they are near decision boundaries. In this paper, we formulate a teacher adversarial local distribution, a set of all adversarial examples within the ball constraint given an input. This distribution is used to sufficiently explore the decision boundaries of the teacher by covering the full spectrum of possible teacher model perturbations. The student model is then regularized by matching the loss between teacher and student using these adversarial example inputs. We conducted a number of experiments on CIFAR-100 and Imagenet datasets to illustrate this teacher adversarial local distribution regularization (TALD) can be applied to improve performance of many existing knowledge distillation methods (e.g., KD, FitNet, CRD, VID, FT, etc.).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial local distribution regularization for knowledge distillation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Adversarial Example Detection Using Latent Neighborhood Graph
Ahmed Abusnaina ... Sunpreet Arora
-
Ahmed Abusnaina, et. al.Ahmed Abusnaina ... Sunpreet Arora
01 Oct 2021
01 Oct 2021

Generating watermarked adversarial texts
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

ACT-Detector: Adaptive channel transformation-based light-weighted detector for adversarial attacks
Jinyin Chen ... Shouling Ji
Information Sciences | VOL. 564
Jinyin Chen, et. al.Jinyin Chen ... Shouling Ji
16 Feb 2021
Information Sciences | VOL. 564

Adversarial Training with Knowledge Distillation Considering Intermediate Representations in CNNs
Hikaru Higuchi ... Hayaru Shouno
-
Hikaru Higuchi, et. al.Hikaru Higuchi ... Hayaru Shouno
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial local distribution regularization for knowledge distillation

Abstract

Talk to us

Similar Papers