Spatial Attention Based Real-Time Object Detection Network for Internet of Things Devices

Yongxin Zhang,Deguang Li,Kostromitin Konstantin,Peng Zhao

doi:10.1109/access.2020.3022645

Yongxin Zhang, Deguang Li + Show 2 more

Open Access

https://doi.org/10.1109/access.2020.3022645

Copy DOI

Abstract

Target detection algorithms for Internet of things (IoT) devices often require both high real-time performance and low computational complexity. Real-time object detection network: You Only Look Once Version 3 (YOLOv3) makes full use of multi-scale features to detect objects by using feature pyramid network structure, and achieves good performance on the premise of guaranteeing fast detection speed. The feature pyramid network of YOLOv3 includes bottom-up feature extraction, top-down sampling and lateral connection of low-level detail features and high-level semantic features. But not all features are useful for object detection. In this article, a novel object detection network Spatial Attention based YOLOv3 (SA-YOLOv3) is proposed. The proposed method adds spatial attention network to the top-down sampling process. The spatial attention network calculates the feature weight matrix based on the up-sampling feature map. SA-YOLOv3 uses the feature weight matrix to filter low-level features and retain more valuable features. Finally, the selected low-level feature map and high-level feature map are concatenated together and feature maps with both spatial information and rich semantic information are obtained. The experimental results on PASCAL VOC2012 datasets and RSOD datasets show that SA-YOLOv3 outperforms YOLOv3.

Highlights

Object detection is a key step for Internet of things (IoT) devices to realize intelligent perception and recognition of objects and processes, such as autonomous driving [1], [2] and robot vision [3]
Our contributions in this work are as follows: (1) We proposed Spatial Attention based You Only Look Once Version 3 (YOLOv3) (SA-YOLOv3) network
In this article, we proposed Spatial Attention based YOLOv3 network (SA-YOLOv3)

Summary

INTRODUCTION

Object detection is a key step for IoT devices to realize intelligent perception and recognition of objects and processes, such as autonomous driving [1], [2] and robot vision [3]. The other branch multiplies the low-level detail feature map L by the weight matrix Z extracted from the location network to generate the attention map Y of the spatial domain. In this process, A2, B2, and C2 layers correspond the output feature maps of three DBL × 5 modules in the multi-scale detection network, respectively. The weight matrix W1 is multiplied by the low-level detail feature map B1 in Darknet 53, and the map SA1_out (size 26 × 26 × 512) is obtained as the output of SA1 network. The SA2_out and the high-level semantic feature map are concatenated and generate the output feature map C2 of the SA-block module

EXPERIMENTS

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Spatial Attention Based Real-Time Object Detection Network for Internet of Things Devices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Improved YOLOv3 with duplex FPN for object detection based on deep learning
Seokyong Shin ... Sang Hun Lee
International Journal of Electrical Engineering & Education | VOL. -
Seokyong Shin, et. al.Seokyong Shin ... Sang Hun Lee
11 Jan 2021
International Journal of Electrical Engineering & Education | VOL. -

Detection of Weeds Growing in Alfalfa Using Convolutional Neural Networks
Jie Yang ... Yong Chen
Agronomy | VOL. 12
Jie Yang, et. al.Jie Yang ... Yong Chen
17 Jun 2022
Agronomy | VOL. 12

Dilated Convolution and Feature Fusion SSD Network for Small Object Detection in Remote Sensing Images
Junsuo Qu ... Zhiwei Zhang
IEEE Access | VOL. 8
Junsuo Qu, et. al.Junsuo Qu ... Zhiwei Zhang
01 Jan 2020
IEEE Access | VOL. 8

Thermal imaging pedestrian detection algorithm based on attention guidance and local cross-level network
Lixian Yu ... Shipeng Han
Journal of Electronic Imaging | VOL. 30
Lixian Yu, et. al.Lixian Yu ... Shipeng Han
25 Sep 2021
Journal of Electronic Imaging | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatial Attention Based Real-Time Object Detection Network for Internet of Things Devices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access