Cooperative Knowledge Distillation: A Learner Agnostic Approach

Michael Livanos,Stephen Wong,Ian Davidson

doi:10.1609/aaai.v38i13.29322

Abstract

Knowledge distillation is a simple but powerful way to transfer knowledge between a teacher model to a student model. Existing work suffers from at least one of the following key limitations in terms of direction and scope of transfer which restrict its use: all knowledge is transferred from teacher to student regardless of whether or not that knowledge is useful, the student is the only one learning in this exchange, and typically distillation transfers knowledge only from a single teacher to a single student. We formulate a novel form of knowledge distillation in which many models can act as both students and teachers which we call cooperative distillation. The models cooperate as follows: a model (the student) identifies specific deficiencies in it's performance and searches for another model (the teacher) who encodes learned knowledge into instructional virtual instances via counterfactual instance generation. Because different models may have different strengths and weaknesses, all models can act as either students or teachers (cooperation) when appropriate and only distill knowledge in areas specific to their strengths (focus). Since counterfactuals as a paradigm are not tied to any specific algorithm, we can use this method to distill knowledge between learners of different architectures, algorithms, and even feature spaces. We demonstrate our approach not only outperforms baselines such as transfer learning, self-supervised learning, and multiple knowledge distillation algorithms on several datasets, but it can also be used in settings where the aforementioned techniques cannot.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cooperative Knowledge Distillation: A Learner Agnostic Approach

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Deep Learning-Enabled Detection of Pneumoperitoneum in Supine and Erect Abdominal Radiography: Modeling Using Transfer Learning and Semi-Supervised Learning.
Sangjoon Park ... Jin Woo Yoon
Korean journal of radiology | VOL. 24
Sangjoon Park, et. al.Sangjoon Park ... Jin Woo Yoon
01 Jan 2023
Korean journal of radiology | VOL. 24

Boosting Self-supervised Video-based Human Action Recognition Through Knowledge Distillation
Fernando Camarena ... Neil Hernandez-Gress
-
Fernando Camarena, et. al.Fernando Camarena ... Neil Hernandez-Gress
28 Nov 2022
28 Nov 2022

Rare and Endangered Plant Leaf Identification Method Based on Transfer Learning and Knowledge Distillation
Lin Wu ... Jingjing Yang
International Journal of Agricultural and Environmental Information Systems | VOL. 12
Lin Wu, et. al.Lin Wu ... Jingjing Yang
29 Oct 2021
International Journal of Agricultural and Environmental Information Systems | VOL. 12

Decomposition and replacement: Spatial knowledge distillation for monocular depth estimation
Minsoo Song ... Wonjun Kim
Journal of Visual Communication and Image Representation | VOL. 85
Minsoo Song, et. al.Minsoo Song ... Wonjun Kim
21 Apr 2022
Journal of Visual Communication and Image Representation | VOL. 85

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cooperative Knowledge Distillation: A Learner Agnostic Approach

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence