Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images

Hang Gong,Bin Wang,Haishan Dai,Abudusalamu Tuniyazi,Chunlai Li,Wenjing Wang,Zhiping He,Feng Han,Xuechan Lang,Zhiyuan Li,Haoyang Li,Qiuxia Li,Tingkui Mu

doi:10.3390/rs14122861

Abstract

Object detection has made tremendous progress in natural images over the last decade. However, the results are hardly satisfactory when the natural image object detection algorithm is directly applied to satellite images. This is due to the intrinsic differences in the scale and orientation of objects generated by the bird’s-eye perspective of satellite photographs. Moreover, the background of satellite images is complex and the object area is small; as a result, small objects tend to be missing due to the challenge of feature extraction. Dense objects overlap and occlusion also affects the detection performance. Although the self-attention mechanism was introduced to detect small objects, the computational complexity increased with the image’s resolution. We modified the general one-stage detector YOLOv5 to adapt the satellite images to resolve the above problems. First, new feature fusion layers and a prediction head are added from the shallow layer for small object detection for the first time because it can maximally preserve the feature information. Second, the original convolutional prediction heads are replaced with Swin Transformer Prediction Heads (SPHs) for the first time. SPH represents an advanced self-attention mechanism whose shifted window design can reduce the computational complexity to linearity. Finally, Normalization-based Attention Modules (NAMs) are integrated into YOLOv5 to improve attention performance in a normalized way. The improved YOLOv5 is termed SPH-YOLOv5. It is evaluated on the NWPU-VHR10 dataset and DOTA dataset, which are widely used for satellite image object detection evaluations. Compared with the basal YOLOv5, SPH-YOLOv5 improves the mean Average Precision (mAP) by 0.071 on the DOTA dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Jun 15, 2022
Citations: 104	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images
Juanjuan Chen ... Junjie Xu
Remote Sensing | VOL. 15
Juanjuan Chen, et. al.Juanjuan Chen ... Junjie Xu
07 Jan 2023
Remote Sensing | VOL. 15

Detection of Multiclass Objects in Satellite Images Using an Improved Algorithmic Approach
Abhimanyu Singh ... Manisha J Nene
-
Abhimanyu Singh, et. al.Abhimanyu Singh ... Manisha J Nene
08 Dec 2022
08 Dec 2022

Towards Efficient Detection for Small Objects via Attention-Guided Detection Network and Data Augmentation.
Xiaobin Wang ... Dekang Zhu
Sensors | VOL. 22
Xiaobin Wang, et. al.Xiaobin Wang ... Dekang Zhu
09 Oct 2022
Sensors | VOL. 22

Small Object Difficulty (SOD) Modeling for Objects Detection in Satellite Images
Debojyoti Biswas ... Jelena Tesic
-
Debojyoti Biswas, et. al.Debojyoti Biswas ... Jelena Tesic
04 Dec 2022
04 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images

Abstract

Talk to us

Similar Papers

More From: Remote Sensing