Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model

Liangchen Song,Jialian Wu,Qian Zhang,Yuan Li,Junsong Yuan,Ming Yang

doi:10.1609/aaai.v35i3.16358

Abstract

When adopting deep neural networks for a new vision task, a common practice is to start with fine-tuning some off-the-shelf well-trained network models from the community. Since a new task may require training a different network architecture with new domain data, taking advantage of off-the-shelf models is not trivial and generally requires considerable try-and-error and parameter tuning. In this paper, we denote a well-trained model as a teacher network and a model for the new task as a student network. We aim to ease the efforts of transferring knowledge from the teacher to the student network, robust to the gaps between their network architectures, domain data, and task definitions. Specifically, we propose a hybrid forward scheme in training the teacher-student models, alternately updating layer weights of the student model. The key merit of our hybrid forward scheme is on the dynamical balance between the knowledge transfer loss and task specific loss in training. We demonstrate the effectiveness of our method on a variety of tasks, e.g., model compression, segmentation, and detection, under a variety of knowledge transfer settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 4

Similar Papers

Multi-level knowledge distillation for low-resolution object detection and facial expression recognition
Tingsong Ma ... Yuanlun Xie
Knowledge-Based Systems | VOL. 240
Tingsong Ma, et. al.Tingsong Ma ... Yuanlun Xie
10 Jan 2022
Knowledge-Based Systems | VOL. 240

Knowledge Transfer via Dense Cross-Layer Mutual-Distillation
Anbang Yao ... Dawei Sun
-
Anbang Yao, et. al.Anbang Yao ... Dawei Sun
01 Jan 2020
01 Jan 2020

A General Dynamic Knowledge Distillation Method for Visual Analytics.
Zhigang Tu ... Xuan Xiao
IEEE Transactions on Image Processing | VOL. PP
Zhigang Tu, et. al.Zhigang Tu ... Xuan Xiao
01 Jan 2021
IEEE Transactions on Image Processing | VOL. PP

Проактивная разметка примеров для адаптации к домену
M.A Ryndin ... D.Y Turdakov
Proceedings of the Institute for System Programming of the RAS | VOL. 31
M.A Ryndin, et. al.M.A Ryndin ... D.Y Turdakov
01 Jan 2019
Proceedings of the Institute for System Programming of the RAS | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence