PETNet: A YOLO-based prior enhanced transformer network for aerial image detection

Tianyu Wang,Zhongjing Ma,Tao Yang,Suli Zou

doi:10.1016/j.neucom.2023.126384

Abstract

Unmanned aerial vehicles (UAVs) have been applied to inspect in various scenarios due to their high efficiency, low cost, and excellent mobility. However, the objects in aerial images are much smaller and denser than general objects, causing it difficult for current object detection methods to achieve the expected results. To solve this issue, a prior enhanced Transformer network (PETNet) based on YOLO is proposed in this paper. Specifically, a novel prior enhanced Transformer (PET) module and a one-to-many feature fusion (OMFF) mechanism are proposed to embed into the network. Two additional detection heads are added to the shallow feature maps. In this work, PET is used to capture enhanced global information to improve the expressive ability of the network. The OMFF aims to fuse multi-type features to minimize the information loss of small objects. In addition, the added detection heads provide more possibility of detecting smaller-scale objects, and the extended multi-head parallel detection is more suitable for the multi-scale transformation of objects in aerial images. On the VisDrone-2021 and UAVDT databases, the proposed PETNet achieves state-of-the-art results with average precision (AP) of 35.3 and 21.5, respectively, which indicates that the proposed network is more suitable for aerial image detection and is of a great reference value.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PETNet: A YOLO-based prior enhanced transformer network for aerial image detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 26, 2023
Citations: 9

Similar Papers

Improved YOLOX-X based UAV aerial photography object detection algorithm
Xin Wang ... Ming Chen
Image and Vision Computing | VOL. 135
Xin Wang, et. al.Xin Wang ... Ming Chen
19 May 2023
Image and Vision Computing | VOL. 135

Novel up-scale feature aggregation for object detection in aerial images
Hu Lin ... Qiong Liu
Neurocomputing | VOL. 411
Hu Lin, et. al.Hu Lin ... Qiong Liu
12 Jun 2020
Neurocomputing | VOL. 411

Adaptive Period Embedding for Representing Oriented Objects in Aerial Images
Yixing Zhu ... Xueqing Wu
IEEE Transactions on Geoscience and Remote Sensing | VOL. 58
Yixing Zhu, et. al.Yixing Zhu ... Xueqing Wu
01 Oct 2020
IEEE Transactions on Geoscience and Remote Sensing | VOL. 58

Attentional single-shot network with multi-scale feature fusion for object detection in aerial images
Yusheng Wang ... Hongzhang Wang
-
Yusheng Wang, et. al.Yusheng Wang ... Hongzhang Wang
06 Nov 2020
06 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PETNet: A YOLO-based prior enhanced transformer network for aerial image detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing