Improved YOLOv4-tiny based on attention mechanism for skin detection.

Ping Li,Yifei Ren,Taiyu Han,Peng Xu,Hongliu Yu

doi:10.7717/peerj-cs.1288

Abstract

An automatic bathing robot needs to identify the area to be bathed in order to perform visually-guided bathing tasks. Skin detection is the first step. The deep convolutional neural network (CNN)-based object detection algorithm shows excellent robustness to light and environmental changes when performing skin detection. The one-stage object detection algorithm has good real-time performance, and is widely used in practical projects. In our previous work, we performed skin detection using Faster R-CNN (ResNet50 as backbone), Faster R-CNN (MobileNetV2 as backbone), YOLOv3 (DarkNet53 as backbone), YOLOv4 (CSPDarknet53 as backbone), and CenterNet (Hourglass as backbone), and found that YOLOv4 had the best performance. In this study, we considered the convenience of practical deployment and used the lightweight version of YOLOv4, i.e., YOLOv4-tiny, for skin detection. Additionally, we added three kinds of attention mechanisms to strengthen feature extraction: SE, ECA, and CBAM. We added the attention module to the two feature layers of the backbone output. In the enhanced feature extraction network part, we applied the attention module to the up-sampled features. For full comparison, we used other lightweight methods that use MobileNetV1, MobileNetV2, and MobileNetV3 as the backbone of YOLOv4. We established a comprehensive evaluation index to evaluate the performance of the models that mainly reflected the balance between model size and mAP. The experimental results revealed that the weight file of YOLOv4-tiny without attention mechanisms was reduced to 9.2% of YOLOv4, but the mAP maintained 67.3% of YOLOv4. YOLOv4-tiny's performance improved after combining the CBAM and ECA modules, but the addition of SE deteriorated the performance of YOLOv4-tiny. MobileNetVX_YOLOv4 (X = 1, 2, 3), which used MobileNetV1, MobileNetV2, and MobileNetV3 as the backbone of YOLOv4, showed higher mAP than YOLOv4-tiny series (including YOLOv4-tiny and three improved YOLOv4-tiny based on the attention mechanism) but had a larger weight file. The network performance was evaluated using the comprehensive evaluation index. The model, which integrates the CBAM attention mechanism and YOLOv4-tiny, achieved a good balance between model size and detection accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PeerJ. Computer science	Publication Date: Mar 10, 2023
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improved YOLOv4-tiny based on attention mechanism for skin detection.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science

Lead the way for us

Similar Papers

Automatic Identification of Individual Primates with Deep Learning Techniques.
Songtao Guo ... Zhihui Shi
iScience | VOL. 23
Songtao Guo, et. al.Songtao Guo ... Zhihui Shi
25 Jul 2020
iScience | VOL. 23

Few-shot Object Detection Model based on Transfer Learning and Convolutional Neural Network
Hou Kaifa ... Wang Hongmei
Journal of Imaging Science and Technology | VOL. 67
Hou Kaifa, et. al.Hou Kaifa ... Wang Hongmei
01 Jul 2023
Journal of Imaging Science and Technology | VOL. 67

Optimization of YOLOv7 Based on PConv, SE Attention and Wise-IoU
Liu Zhigang ... Bi Kaiyu
International Journal of Computational Intelligence and Applications | VOL. 23
Liu Zhigang, et. al.Liu Zhigang ... Bi Kaiyu
01 Mar 2024
International Journal of Computational Intelligence and Applications | VOL. 23

A New Pulmonary Nodule Detection Based on Multiscale Convolutional Neural Network with Channel and Attention Mechanism
Yingying Zhao ... Jiaxin Wang
-
Yingying Zhao, et. al.Yingying Zhao ... Jiaxin Wang
02 Jul 2022
02 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved YOLOv4-tiny based on attention mechanism for skin detection.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science