DCEF2-YOLO: Aerial Detection YOLO with Deformable Convolution–Efficient Feature Fusion for Small Target Detection

Yeonha Shin,Jaehyuk Youn,Minyoung Back,Heesub Shin,Jaewoo Ok,Sungho Kim

doi:10.3390/rs16061071

Yeonha Shin, Jaehyuk Youn + Show 4 more

Open Access

PDF Available

https://doi.org/10.3390/rs16061071

Copy DOI

Export

Save

Cite

Journal: Remote Sensing	Publication Date: Mar 18, 2024
Citations: 1	License type: CC BY 4.0

Affiliation: Yeungnam University

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Deep learning technology for real-time small object detection in aerial images can be used in various industrial environments such as real-time traffic surveillance and military reconnaissance. However, detecting small objects with few pixels and low resolution remains a challenging problem that requires performance improvement. To improve the performance of small object detection, we propose DCEF 2-YOLO. Our proposed method enables efficient real-time small object detection by using a deformable convolution (DFConv) module and an efficient feature fusion structure to maximize the use of the internal feature information of objects. DFConv preserves small object information by preventing the mixing of object information with the background. The optimized feature fusion structure produces high-quality feature maps for efficient real-time small object detection while maximizing the use of limited information. Additionally, modifying the input data processing stage and reducing the detection layer to suit small object detection also contributes to performance improvement. When compared to the performance of the latest YOLO-based models (such as DCN-YOLO and YOLOv7), DCEF 2-YOLO outperforms them, with a mAP of +6.1% on the DOTA-v1.0 test set, +0.3% on the NWPU VHR-10 test set, and +1.5% on the VEDAI512 test set. Furthermore, it has a fast processing speed of 120.48 FPS with an RTX3090 for 512 × 512 images, making it suitable for real-time small object detection tasks.

Full Text