Teacher or supervisor? Effective online knowledge distillation via guided collaborative learning

Diana Laura Borza,Tudor Alexandru Ileni,Alexandru Ion Marinescu,Sergiu Adrian Darabant

doi:10.1016/j.cviu.2023.103632

Abstract

Knowledge distillation is a widely-used and effective technique to boost the performance of a lightweight student network, by having it mimic the behavior of a more powerful teacher network. This paper presents an end-to-end online knowledge distillation strategy, in which several peer students are trained together and their predictions are aggregated into a powerful teacher ensemble via an effective ensembling technique that uses an online supervisor network to determine the optimal way of combining the student logits. Intuitively, this supervisor network learns the area of expertise of each student and assigns a weight to each student accordingly►it has knowledge of the input image, the ground truth data, and the predictions of each individual student, and tries to answer the following question: “how much can we rely on each student’s prediction, given the current input image with this ground truth class?”. The proposed technique can be thought of as an inference optimization mechanism as it improves the overall accuracy over the same number of parameters. The experiments we performed show that the proposed knowledge distillation consistently improves the performance of the knowledge-distilled students vs. the independently trained students.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Teacher or supervisor? Effective online knowledge distillation via guided collaborative learning

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding

Lead the way for us

Journal: Computer Vision and Image Understanding	Publication Date: Jan 18, 2023
Citations: 4

Similar Papers

Feature fusion-based collaborative learning for knowledge distillation
Yiting Li ... Weihua Ou
International Journal of Distributed Sensor Networks | VOL. 17
Yiting Li, et. al.Yiting Li ... Weihua Ou
01 Nov 2021
International Journal of Distributed Sensor Networks | VOL. 17

Robust cross-lingual knowledge base question answering via knowledge distillation
Shaofei Wang ... Depeng Dang
Data Technologies and Applications | VOL. 55
Shaofei Wang, et. al.Shaofei Wang ... Depeng Dang
30 Apr 2021
Data Technologies and Applications | VOL. 55

Adversarial Metric Knowledge Distillation
Zihe Dong ... Xin Sun
-
Zihe Dong, et. al.Zihe Dong ... Xin Sun
27 Nov 2020
27 Nov 2020

Relay knowledge distillation for efficiently boosting the performance of shallow networks
Shipeng Fu ... Xiaomin Yang
Neurocomputing | VOL. 514
Shipeng Fu, et. al.Shipeng Fu ... Xiaomin Yang
29 Sep 2022
Neurocomputing | VOL. 514

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Teacher or supervisor? Effective online knowledge distillation via guided collaborative learning

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding