SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference

Boren Hu,Siliang Tang,Yun Zhu,Jiacheng Li

doi:10.24963/ijcai.2023/563

Abstract

Dynamic early exiting has been proven to improve the inference speed of the pre-trained language model like BERT. However, all samples must go through all consecutive layers before early exiting and more complex samples usually go through more layers, which still exists redundant computation. In this paper, we propose a novel dynamic early exiting combined with layer skipping for BERT inference named SmartBERT, which adds a skipping gate and an exiting operator into each layer of BERT. SmartBERT can adaptively skip some layers and adaptively choose whether to exit. Besides, we propose cross-layer contrastive learning and combine it into our training phases to boost the intermediate layers and classifiers which would be beneficial for early exiting. To keep the inconsistent usage of skipping gates between training and inference phases, we propose a hard weight mechanism during training phase. We conduct experiments on eight classification datasets of the GLUE benchmark. Experimental results show that SmartBERT achieves 2-3× computation reduction with minimal accuracy drops compared with BERT and our method outperforms previous methods in both efficiency and accuracy. Moreover, in some complex datasets, we prove that the early exiting based on entropy hardly works, and the skipping mechanism is essential for reducing computation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Skeleton-Based Human Motion Prediction With Privileged Supervision.
Minjing Dong ... Chang Xu
IEEE transactions on neural networks and learning systems | VOL. 34
Minjing Dong, et. al.Minjing Dong ... Chang Xu
01 Dec 2023
IEEE transactions on neural networks and learning systems | VOL. 34

Learning to Match Anchors for Visual Object Detection.
Xiaosong Zhang ... Qixiang Ye
IEEE transactions on pattern analysis and machine intelligence | VOL. 44
Xiaosong Zhang, et. al.Xiaosong Zhang ... Qixiang Ye
12 Jan 2021
IEEE transactions on pattern analysis and machine intelligence | VOL. 44

Enhancing Visual Question Answering Using Dropout
Zhiwei Fang ... Yanyuan Qiao
-
Zhiwei Fang, et. al.Zhiwei Fang ... Yanyuan Qiao
15 Oct 2018
15 Oct 2018

A Random Focusing Method with Jensen–Shannon Divergence for Improving Deep Neural Network Performance Ensuring Architecture Consistency
Wonjik Kim
Neural Processing Letters | VOL. 56
Wonjik KimWonjik Kim
17 Jun 2024
Neural Processing Letters | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference

Abstract

Talk to us

Similar Papers