Abstract

Due to the broad usage and widespread popularity of drones, the demand for a more accurate object detection algorithm for images captured by drone platforms has become increasingly urgent. This article addresses this issue by first analyzing the unique characteristics of datasets related to drones. We then select the widely used YOLOv7 algorithm as the foundation and conduct a comprehensive analysis of its limitations, proposing a targeted solution. In order to enhance the network’s ability to extract features from small objects, we introduce non-strided convolution modules and integrate modules that utilize attention mechanism principles into the baseline network. Additionally, we improve the semantic information expression for small targets by optimizing the feature fusion process in the network. During training, we adopt the latest Lion optimizer and MPDIoU loss to further boost the overall performance of the network. The improved network achieves impressive results, with mAP50 scores of 56.8% and 94.6% on the VisDrone2019 and NWPU VHR-10 datasets, respectively, particularly in detecting small objects.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call