Face Model Compression by Distilling Knowledge from Neurons

Ping Luo,Xiaoou Tang,Xiaogang Wang,Ziwei Liu,Zhenyao Zhu

doi:10.1609/aaai.v30i1.10449

Abstract

The recent advanced face recognition systems werebuilt on large Deep Neural Networks (DNNs) or theirensembles, which have millions of parameters. However, the expensive computation of DNNs make theirdeployment difficult on mobile and embedded devices. This work addresses model compression for face recognition,where the learned knowledge of a large teachernetwork or its ensemble is utilized as supervisionto train a compact student network. Unlike previousworks that represent the knowledge by the soften labelprobabilities, which are difficult to fit, we represent theknowledge by using the neurons at the higher hiddenlayer, which preserve as much information as the label probabilities, but are more compact. By leveragingthe essential characteristics (domain knowledge) of thelearned face representation, a neuron selection methodis proposed to choose neurons that are most relevant toface recognition. Using the selected neurons as supervisionto mimic the single networks of DeepID2+ andDeepID3, which are the state-of-the-art face recognition systems, a compact student with simple network structure achieves better verification accuracy on LFW than its teachers, respectively. When using an ensemble of DeepID2+ as teacher, a mimicked student is able to outperform it and achieves 51.6 times compression ratio and 90 times speed-up in inference, making this cumbersome model applicable on portable devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Face Model Compression by Distilling Knowledge from Neurons

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 5, 2016
Citations: 144

Similar Papers

Energy efficient stochastic-based deep spiking neural networks for sparse datasets
Mohammed Alawad ... Hong-Jun Yoon
-
Mohammed Alawad, et. al.Mohammed Alawad ... Hong-Jun Yoon
01 Dec 2017
01 Dec 2017

Merging Similar Neurons for Deep Networks Compression
Guoqiang Zhong ... Jinxuan Sun
Cognitive Computation | VOL. 12
Guoqiang Zhong, et. al.Guoqiang Zhong ... Jinxuan Sun
16 Jan 2020
Cognitive Computation | VOL. 12

Efficiently Coevolving Deep Neural Networks and Data Augmentations
Shane Acton ... Sasha Abramowitz
-
Shane Acton, et. al.Shane Acton ... Sasha Abramowitz
01 Dec 2020
01 Dec 2020

Collaborative Consistent Knowledge Distillation Framework for Remote Sensing Image Scene Classification Network
Shiyi Xing ... Jinsheng Xing
Remote Sensing | VOL. 14
Shiyi Xing, et. al.Shiyi Xing ... Jinsheng Xing
17 Oct 2022
Remote Sensing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Face Model Compression by Distilling Knowledge from Neurons

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence