Performance-Efficiency Comparisons of Channel Attention Modules for ResNets

Sander R Klomp,Rob G J Wijnhoven,Peter H N De With

doi:10.1007/s11063-023-11161-z

Sander R Klomp, Rob G J Wijnhoven + Show 1 more

Open Access

https://doi.org/10.1007/s11063-023-11161-z

Copy DOI

Abstract

Attention modules can be added to neural network architectures to improve performance. This work presents an extensive comparison between several efficient attention modules for image classification and object detection, in addition to proposing a novel Attention Bias module with lower computational overhead. All measured attention modules have been efficiently re-implemented, which allows an objective comparison and evaluation of the relationship between accuracy and inference time. Our measurements show that single-image inference time increases far more (5–50%) than the increase in FLOPs suggests (0.2–3%) for a limited gain in accuracy, making computation cost an important selection criterion. Despite this increase in inference time, adding an attention module can outperform a deeper baseline ResNet in both speed and accuracy. Finally, we investigate the potential of adding attention modules to pretrained networks and show that fine-tuning is possible and superior to training from scratch. The choice of the best attention module strongly depends on the specific ResNet architecture, input resolution, batch size and inference framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Processing Letters	Publication Date: Feb 11, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Performance-Efficiency Comparisons of Channel Attention Modules for ResNets

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters

Lead the way for us

Similar Papers

Inner-imaging 3D attention module for residual network
Wenjie Liu ... Fuji Ren
Applied Intelligence | VOL. 53
Wenjie Liu, et. al.Wenjie Liu ... Fuji Ren
13 Apr 2022
Applied Intelligence | VOL. 53

YOLO V4 with hybrid dilated convolution attention module for object detection in the aerial dataset
Kun Wang ... Zeyi Wei
International Journal of Remote Sensing | VOL. 43
Kun Wang, et. al.Kun Wang ... Zeyi Wei
16 Feb 2022
International Journal of Remote Sensing | VOL. 43

One Spatio-Temporal Sharpening Attention Mechanism for Light-Weight YOLO Models Based on Sharpening Spatial Attention.
Mengfan Xue ... Yunfei Guo
Sensors | VOL. 21
Mengfan Xue, et. al.Mengfan Xue ... Yunfei Guo
28 Nov 2021
Sensors | VOL. 21

Balance Multi-Head Attention based on Software and Hardware Co-design
Dian Xu ... Qingsong Shi
-
Dian Xu, et. al.Dian Xu ... Qingsong Shi
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance-Efficiency Comparisons of Channel Attention Modules for ResNets

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters