A Novel Transformer-Based Adaptive Object Detection Method

Shuzhi Su,Xianjin Fang,Runbin Chen,Tian Zhang

doi:10.3390/electronics12030478

Shuzhi Su, Xianjin Fang + Show 2 more

Open Access

https://doi.org/10.3390/electronics12030478

Copy DOI

Journal: Electronics	Publication Date: Jan 17, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: Anhui University of Science and Technology

Abstract

To accurately detect multi-scale remote sensing objects in complex backgrounds, we propose a novel transformer-based adaptive object detection method. The backbone network of the method is a dual attention vision transformer network that utilizes spatial window attention and channel group attention to capture feature interactions between different objects in complex scenes. We further design an adaptive path aggregation network. In the designed network, CBAM (Convolutional Block Attention Module) is utilized to suppress background information in the fusion paths of different-level feature maps, and new paths are introduced to fuse same-scale feature maps to increase the feature information of the feature maps. The designed network can provide more effective feature information and improve the feature representation capability. Experiments conducted on the three datasets of RSOD, NWPU VHR-10, and DIOR show that the mAP of our method is 96.9%, 96.6%, and 81.7%, respectively, which outperforms compared object detection methods. The experimental results show that our method can detect remote-sensing objects better.

Full Text