MobileNetV3 With CBAM for Bamboo Stick Counting

Liangquan Jia,Linhua Jiang,Huanan Leng,Zhanchun Xiao,Wei Long,Yawen Wang,Ying Zang,Quanfeng Li

doi:10.1109/access.2022.3175818

Abstract

This study aims to solve the problems of inaccurate weighing of bamboo sticks and inefficient manual counting. To overcome this problem, an improved MobileNetV3 model and a counting algorithm suitable for bamboo sticks—combined with a spatial-temporal attention mechanism—are proposed in this paper. Inspired by the idea of EfficientNet, scaling coefficients are used to scale the MobileNetV3 network structure as a whole in terms of width and height. The optimal model for bamboo-stick recognition is screened initially, and then the algorithm uses the convolutional block attention module (CBAM) attention mechanism to replace the squeeze-and-excitation (SE) attention mechanism in the MobileNetV3 network structure to allow the network to extract features in the two dimensions of channel and space. Since the number of bamboo sticks in a single image is extremely dense—generally around 1000–3000—it is difficult to effectively count them with existing algorithms. The proposed algorithm divides the image into multiple equally sized blocks and then uses the boundary processing algorithm to merge the cut bamboo stick images and count the number of sticks. Experimental results show that the proposed algorithm can effectively perform near real-time detection on a mobile terminal and its accuracy can reach approximately 97%, which is in line with actual production applications.

Full Text