A coded knowledge distillation framework for image classification based on adaptive JPEG encoding

Ahmed H Salamah,Shayan Mohajer Hamidi,En-Hui Yang

doi:10.1016/j.patcog.2024.110966

Abstract

In knowledge distillation (KD), a lightweight student model yields enhanced test accuracy by mimicking the behavior of a pre-trained large model (teacher). However, the cumbersome teacher model often makes over-confident responses, resulting in poor generalization when presented with unseen data. Consequently, a student trained by such a teacher also inherits this problem. To mitigate this issue, in this paper, we present a new framework of KD dubbed coded knowledge distillation (CKD) in which the student is trained to mimic instead the behavior of a coded teacher. Compared to the teacher in KD, the coded teacher in CKD has an additional adaptive encoding layer in the front, which adaptively encodes an input image into a compressed version (using JPEG encoding for instance) and then feeds the compressed input image to the pre-trained teacher. Comprehensive experimental results show the effectiveness of CKD over KD. In addition, we extend the deployment of a coded teacher to other knowledge transfer methods, showcasing its ability to enhance test accuracy across these methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A coded knowledge distillation framework for image classification based on adaptive JPEG encoding

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Sep 2, 2024
License type: cc-by-nc

Similar Papers

When sub-band features meet attention mechanism while knowledge distillation for sound classification
Achyut Mani Tripathi ... Konark Paul
Applied Acoustics | VOL. 195
Achyut Mani Tripathi, et. al.Achyut Mani Tripathi ... Konark Paul
01 Jun 2022
Applied Acoustics | VOL. 195

Mining Data Impressions From Deep Models as Substitute for the Unavailable Training Data.
Gaurav Kumar Nayak ... Anirban Chakraborty
IEEE transactions on pattern analysis and machine intelligence | VOL. 44
Gaurav Kumar Nayak, et. al.Gaurav Kumar Nayak ... Anirban Chakraborty
01 Jan 2020
IEEE transactions on pattern analysis and machine intelligence | VOL. 44

Safe Distillation Box
Jingwen Ye ... Jie Song
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Jingwen Ye, et. al.Jingwen Ye ... Jie Song
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Revisiting Knowledge Distillation via Label Smoothing Regularization
Li Yuan ... Francis Eh Tay
-
Li Yuan, et. al.Li Yuan ... Francis Eh Tay
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A coded knowledge distillation framework for image classification based on adaptive JPEG encoding

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition