Detection In Aerial Images Research Articles

The progress of object detection technology is crucial for obtaining extensive scene information from aerial perspectives based on computer vision. However, aerial image detection presents many challenges, such as large image background sizes, small object sizes, and dense distributions. This research addresses the specific challenges relating to small object detection in aerial images and proposes an improved YOLOv8s-based detector named Aerial Images Detector-YOLO(AID-YOLO). Specifically, this study adopts the General Efficient Layer Aggregation Network (GELAN) from YOLOv9 as a reference and designs a four-branch skip-layer connection and split operation module Re-parameterization-Net with Cross-Stage Partial CSP and Efficient Layer Aggregation Networks (RepNCSPELAN4) to achieve a lightweight network while capturing richer feature information. To fuse multi-scale features and focus more on the target detection regions, a new multi-channel feature extraction module named Convolutional Block Attention Module with Two Convolutions Efficient Layer Aggregation Net-works (C2FCBAM) is designed in the neck part of the network. In addition, to reduce the sensitivity to position bias of small objects, a new function, Normalized Weighted Distance Complete Intersection over Union (NWD-CIoU_Loss) weight adaptive loss function, was designed in this study. We evaluate the proposed AID-YOLO method through ablation experiments and comparisons with other advanced models on the VEDAI (512, 1024) and DOTAv1.0 datasets. The results show that compared to the Yolov8s baseline model, AID-YOLO improves the mAP@0.5 metric by 7.36% on the VEDAI dataset. Simultaneously, the parameters are reduced by 31.7%, achieving a good balance between accuracy and parameter quantity. The Average Precision (AP) for small objects has improved by 8.9% compared to the baseline model (YOLOv8s), making it one of the top performers among all compared models. Furthermore, the FPS metric is also well-suited for real-time detection in aerial image scenarios. The AID-YOLO method also demonstrates excellent performance on infrared images in the VEDAI1024 (IR) dataset, with a 2.9% improvement in the mAP@0.5 metric. We further validate the superior detection and generalization performance of AID-YOLO in multi-modal and multi-task scenarios through comparisons with other methods on different resolution images, SODA-A and the DOTAv1.0 datasets. In summary, the results of this study confirm that the AID-YOLO method significantly improves model detection performance while maintaining a reduced number of parameters, making it applicable to practical engineering tasks in aerial image object detection.

Read full abstract

Generalized target detection algorithms perform well for large- and medium-sized targets but struggle with small ones. However, with the growing importance of aerial images in urban transportation and environmental monitoring, detecting small targets in such imagery has been a promising research hotspot. The challenge in small object detection lies in the limited pixel proportion and the complexity of feature extraction. Moreover, current mainstream detection algorithms tend to be overly complex, leading to structural redundancy for small objects. To cope with these challenges, this paper recommends the PCSG model based on yolov5, which optimizes both the detection head and backbone networks. (1) An enhanced detection header is introduced, featuring a new structure that enhances the feature pyramid network and the path aggregation network. This enhancement bolsters the model’s shallow feature reuse capability and introduces a dedicated detection layer for smaller objects. Additionally, redundant structures in the network are pruned, and the lightweight and versatile upsampling operator CARAFE is used to optimize the upsampling algorithm. (2) The paper proposes the module named SPD-Conv to replace the strided convolution operation and pooling structures in yolov5, thereby enhancing the backbone’s feature extraction capability. Furthermore, Ghost convolution is utilized to optimize the parameter count, ensuring that the backbone meets the real-time needs of aerial image detection. The experimental results from the RSOD dataset show that the PCSG model exhibits superior detection performance. The value of mAP increases from 97.1% to 97.8%, while the number of model parameters decreases by 22.3%, from 1,761,871 to 1,368,823. These findings unequivocally highlight the effectiveness of this approach.

Read full abstract

Detection In Aerial Images Research Articles

Related Topics

Articles published on Detection In Aerial Images

Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images.

Variational Autoencoder with Gaussian Random Field prior: Application to unsupervised animal detection in aerial images

Stage-by-Stage Adaptive Alignment Mechanism for Object Detection in Aerial Images

AID-YOLO: An Efficient and Lightweight Network Method for Small Target Detector in Aerial Images

Quad Gaussian Networks for Vehicle Detection in Aerial Images.

DFS-DETR: Detailed-Feature-Sensitive Detector for Small Object Detection in Aerial Images Using Transformer

Black-box adversarial patch attacks using differential evolution against aerial imagery object detectors

FR-YOLOv7: feature enhanced YOLOv7 for rotated small object detection in aerial images

Image augmentation approaches for small and tiny object detection in aerial images: a review

MPE-YOLO: enhanced small target detection in aerial imaging

Recent Advances for Aerial Object Detection: A Survey

Position information encoding FPN for small object detection in aerial images

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

An Aerial Image Detection Algorithm Based on Improved YOLOv5.

AFDet: alignment and focusing for aerial object detection

A deep neural network for vehicle detection in aerial images

Multi-scale object detection in UAV images based on adaptive feature fusion.

DCEF2-YOLO: Aerial Detection YOLO with Deformable Convolution–Efficient Feature Fusion for Small Target Detection

MS-YOLO: integration-based multi-subnets neural network for object detection in aerial images

Deep Learning Models for Waterfowl Detection and Classification in Aerial Images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Detection In Aerial Images Research Articles

Related Topics

Articles published on Detection In Aerial Images

Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images.

Variational Autoencoder with Gaussian Random Field prior: Application to unsupervised animal detection in aerial images

Stage-by-Stage Adaptive Alignment Mechanism for Object Detection in Aerial Images

AID-YOLO: An Efficient and Lightweight Network Method for Small Target Detector in Aerial Images

Quad Gaussian Networks for Vehicle Detection in Aerial Images.

DFS-DETR: Detailed-Feature-Sensitive Detector for Small Object Detection in Aerial Images Using Transformer

Black-box adversarial patch attacks using differential evolution against aerial imagery object detectors

FR-YOLOv7: feature enhanced YOLOv7 for rotated small object detection in aerial images

Image augmentation approaches for small and tiny object detection in aerial images: a review

MPE-YOLO: enhanced small target detection in aerial imaging

Recent Advances for Aerial Object Detection: A Survey

Position information encoding FPN for small object detection in aerial images

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

An Aerial Image Detection Algorithm Based on Improved YOLOv5.

AFDet: alignment and focusing for aerial object detection

A deep neural network for vehicle detection in aerial images

Multi-scale object detection in UAV images based on adaptive feature fusion.

DCEF2-YOLO: Aerial Detection YOLO with Deformable Convolution–Efficient Feature Fusion for Small Target Detection

MS-YOLO: integration-based multi-subnets neural network for object detection in aerial images

Deep Learning Models for Waterfowl Detection and Classification in Aerial Images