PURSUhInT: In Search of Informative Hint Points Based on Layer Clustering for Knowledge Distillation

Reyhan Kevser Keser,Aydin Ayanzadeh,Omid Abdollahi Aghdam,Caglar Kilcioglu,Behcet Ugur Toreyin,Nazim Kemal Ure

doi:10.1016/j.eswa.2022.119040

Abstract

One of the most efficient methods for model compression is hint distillation, where the student model is injected with information (hints) from several different layers of the teacher model. Although the selection of hint points can drastically alter the compression performance, conventional distillation approaches overlook this fact and use the same hint points as in the early studies. Therefore, we propose a clustering based hint selection methodology, where the layers of teacher model are clustered with respect to several metrics and the cluster centers are used as the hint points. Our method is applicable for any student network, once it is applied on a chosen teacher network. The proposed approach is validated in CIFAR-100 and ImageNet datasets, using various teacher–student pairs and numerous hint distillation methods. Our results show that hint points selected by our algorithm results in superior compression performance compared to state-of-the-art knowledge distillation algorithms on the same student models and datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PURSUhInT: In Search of Informative Hint Points Based on Layer Clustering for Knowledge Distillation

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Oct 20, 2022
Citations: 4

Similar Papers

Deep Learning Model Compression With Rank Reduction in Tensor Decomposition.
Wei Dai ... Kai Hwang
IEEE transactions on neural networks and learning systems | VOL. PP
Wei Dai, et. al.Wei Dai ... Kai Hwang
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

Security in defect detection: A new one-pixel attack for fooling DNNs
Pengchuan Wang ... Amrit Mukherjee
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Pengchuan Wang, et. al.Pengchuan Wang ... Amrit Mukherjee
15 Aug 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

Multistructure-Based Collaborative Online Distillation.
Liang Gao ... Kele Xu
Entropy (Basel, Switzerland) | VOL. 21
Liang Gao, et. al.Liang Gao ... Kele Xu
02 Apr 2019
Entropy (Basel, Switzerland) | VOL. 21

HODEC: Towards Efficient High-Order DEcomposed Convolutional Neural Networks
Miao Yin ... Yu Gong
-
Miao Yin, et. al.Miao Yin ... Yu Gong
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PURSUhInT: In Search of Informative Hint Points Based on Layer Clustering for Knowledge Distillation

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications