Boosting Residual Networks with Group Knowledge

Shengji Tang,Baopu Li,Tong He,Weihao Lin,Tao Chen,Peng Ye,Wanli Ouyang,Chong Yu

doi:10.1609/aaai.v38i6.28322

Abstract

Recent research understands the residual networks from a new perspective of the implicit ensemble model. From this view, previous methods such as stochastic depth and stimulative training have further improved the performance of the residual network by sampling and training of its subnets. However, they both use the same supervision for all subnets of different capacities and neglect the valuable knowledge generated by subnets during training. In this manuscript, we mitigate the significant knowledge distillation gap caused by using the same kind of supervision and advocate leveraging the subnets to provide diverse knowledge. Based on this motivation, we propose a group knowledge based training framework for boosting the performance of residual networks. Specifically, we implicitly divide all subnets into hierarchical groups by subnet-in-subnet sampling, aggregate the knowledge of different subnets in each group during training, and exploit upper-level group knowledge to supervise lower-level subnet group. Meanwhile, we also develop a subnet sampling strategy that naturally samples larger subnets, which are found to be more helpful than smaller subnets in boosting performance for hierarchical groups. Compared with typical subnet training and other methods, our method achieves the best efficiency and performance trade-offs on multiple datasets and network structures. The code is at https://github.com/tsj-001/AAAI24-GKT.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Boosting Residual Networks with Group Knowledge

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Multiple Residual Learning Network for Single Image Super-Resolution
Renhe Liu ... Guoqing Lei
-
Renhe Liu, et. al.Renhe Liu ... Guoqing Lei
01 Dec 2018
01 Dec 2018

A Deep Learning Method for Bearing Fault Diagnosis through Stacked Residual Dilated Convolutions
Zilong Zhuang ... Huichun Lv
Applied Sciences | VOL. 9
Zilong Zhuang, et. al.Zilong Zhuang ... Huichun Lv
01 May 2019
Applied Sciences | VOL. 9

Research on Improved Residual Network Classification Method for Defect Recognition of Thermal Battery
Wenchao Xu ... Tao Zhao
IEEE Access | VOL. 10
Wenchao Xu, et. al.Wenchao Xu ... Tao Zhao
01 Jan 2021
IEEE Access | VOL. 10

Study on Stellar Spectra Classification Based on Multitask Residual Neural Network
Yuxiang Lu ... Jingchang Pan
-
Yuxiang Lu, et. al.Yuxiang Lu ... Jingchang Pan
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Boosting Residual Networks with Group Knowledge

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence