Attention Based Data Augmentation for Knowledge Distillation with Few Data

Shengzhao Tian,Duanbing Chen

doi:10.1088/1742-6596/2171/1/012058

Abstract

Knowledge distillation has attracted great attentions from computer vision researchers in recent years. However, the performance of student model will suffer from the absence of the complete dataset, which is used to train the teacher model. Especially for conducting knowledge distillation between heterogeneous models, it is difficult for student model to learn and receive guidance with few data. In this paper, a data augmentation method is proposed based on the attentional response of teacher model. The proposed method utilizes the knowledge in teacher model without requiring homogeneous architecture between teacher model and student model. Experimental results demonstrate that combining the proposed data augmentation method with different knowledge distillation methods, the performance of student model can be improved in knowledge distillation with few data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention Based Data Augmentation for Knowledge Distillation with Few Data

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Jan 1, 2022
License type: cc-by

Similar Papers

What Role Does Data Augmentation Play in Knowledge Distillation?
Wei Li ... Weiyan Liu
-
Wei Li, et. al.Wei Li ... Weiyan Liu
01 Jan 2023
01 Jan 2023

Multi-perspective analysis on data augmentation in knowledge distillation
Wei Li ... Aiguo Song
Neurocomputing | VOL. 583
Wei Li, et. al.Wei Li ... Aiguo Song
05 Mar 2024
Neurocomputing | VOL. 583

Discretization and decoupled knowledge distillation for arbitrary oriented object detection
Cheng Chen ... Hongwei Ding
Digital Signal Processing | VOL. 150
Cheng Chen, et. al.Cheng Chen ... Hongwei Ding
17 Apr 2024
Digital Signal Processing | VOL. 150

A General Multiple Data Augmentation Based Framework for Training Deep Neural Networks
Binyan Hu ... A K Qin
-
Binyan Hu, et. al.Binyan Hu ... A K Qin
18 Jul 2022
18 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention Based Data Augmentation for Knowledge Distillation with Few Data

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series