Layer Compression of Deep Networks with Straight Flows

Chengyue Gong,Xiaocong Du,Qiang Liu,Arun Kejariwal,Dhruv Choudhary,Lemeng Wu,Bhargav Bhushanam,Xingchao Liu

doi:10.1609/aaai.v38i11.29107

Abstract

Very deep neural networks lead to significantly better performance on various real tasks. However, it usually causes slow inference and is hard to be deployed on real-world devices. How to reduce the number of layers to save memory and to accelerate the inference is an eye-catching topic. In this work, we introduce an intermediate objective, a continuous-time network, before distilling deep networks into shallow networks. First, we distill a given deep network into a continuous-time neural flow model, which can be discretized with an ODE solver and the inference requires passing through the network multiple times. By forcing the flow transport trajectory to be straight lines, we find that it is easier to compress the infinite step model into a one-step neural flow model, which only requires passing through the flow model once. Secondly, we refine the one-step flow model together with the final head layer with knowledge distillation and finally, we can replace the given deep network with this one-step flow network. Empirically, we demonstrate that our method outperforms direct distillation and other baselines on different model architectures (e.g. ResNet, ViT) on image classification and semantic segmentation tasks. We also manifest that our distilled model naturally serves as an early-exit dynamic inference model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Layer Compression of Deep Networks with Straight Flows

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

End-to-End Image Super-Resolution via Deep and Shallow Convolutional Networks
Yifan Wang ... Peihua Li
IEEE Access | VOL. 7
Yifan Wang, et. al.Yifan Wang ... Peihua Li
01 Jan 2019
IEEE Access | VOL. 7

Energy Disaggregation for NILM applications using Shallow and Deep Networks
Lakshmi Nambiar ... Vinod Kumargopal
-
Lakshmi Nambiar, et. al.Lakshmi Nambiar ... Vinod Kumargopal
01 Mar 2019
01 Mar 2019

Hyperspectral image classification based on deep stacking network
Mingyi He ... Jing Zhang
-
Mingyi He, et. al.Mingyi He ... Jing Zhang
01 Jul 2016
01 Jul 2016

Secure shell (ssh) traffic analysis with flow based features using shallow and deep networks
R Vinayakumar ... K P Soman
-
R Vinayakumar, et. al.R Vinayakumar ... K P Soman
01 Sep 2017
01 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Layer Compression of Deep Networks with Straight Flows

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence