A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection

Wanjie Lu,Qunshan Shi,Chaoyang Niu,Liang Lyu,Wei Liu,Chaozhen Lan,Shiju Wang

doi:10.1109/jstars.2023.3234161

Abstract

The object detection of unmanned aerial vehicle (UAV) images has widespread applications in numerous fields; however, the complex background, diverse scales, and uneven distribution of objects in UAV images make object detection a challenging task. This study proposes a convolution neural network transformer hybrid model to achieve efficient object detection in UAV images, which has three advantages that contribute to improving object detection performance. First, the efficient and effective cross-shaped window (CSWin) transformer can be used as a backbone to obtain image features at different levels, and the obtained features can be input into the feature pyramid network to achieve multiscale representation, which will contribute to multiscale object detection. Second, a hybrid patch embedding module is constructed to extract and utilize low-level information such as the edges and corners of the image. Finally, a slicing-based inference method is constructed to fuse the inference results of the original image and sliced images, which will improve the small object detection accuracy without modifying the original network. Experimental results on public datasets illustrate that the proposed method can improve performance more effectively than several popular and state-of-the-art object detection methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing	Publication Date: Jan 1, 2023
Citations: 23	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Lead the way for us

Similar Papers

HRCTNet: a hybrid network with high-resolution representation for object detection in UAV image
Wenjie Xing ... Jing Qi
Complex & Intelligent Systems | VOL. 9
Wenjie Xing, et. al.Wenjie Xing ... Jing Qi
15 May 2023
Complex & Intelligent Systems | VOL. 9

Small object detection in UAV image based on improved YOLOv5
Jian Zhang ... Zhiyuan Huang
Systems Science & Control Engineering | VOL. 11
Jian Zhang, et. al.Jian Zhang ... Zhiyuan Huang
15 Aug 2023
Systems Science & Control Engineering | VOL. 11

Learning-based Object Detection in High Resolution UAV Images: An Empirical Study
Haijun Zhang ... Shichao Xu
-
Haijun Zhang, et. al.Haijun Zhang ... Shichao Xu
01 Jul 2019
01 Jul 2019

Object Detection in UAV Images via Global Density Fused Convolutional Network
Ruiqian Zhang ... Xiao Huang
Remote Sensing | VOL. 12
Ruiqian Zhang, et. al.Ruiqian Zhang ... Xiao Huang
24 Sep 2020
Remote Sensing | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing