With the development of artificial intelligence technology and Unmanned Aerial Vehicle (UAV) technology, the traditional UAV target detection methods are difficult to achieve high detection accuracy in complex and changeable scenes. The paper selects Faster Region Convolutional Neural Network (Faster R-CNN) as the basic algorithm due to its high scalability and excellent classification performance. The convolution mode of the convolution layer in the network structure is adjusted to deformable convolution. Additionally, a hybrid attention mechanism module is added to combine channel attention and spatial attention modules behind the output layer of the Faster R-CNN network. Finally, the Soft Non-Maximum Suppression (Soft-NMS) method is selected to combine the improved Faster R-CNN algorithm of multiple individuals into the final target detection model. The performance of the improved Faster R-CNN algorithm and the original Faster R-CNN algorithm was verified through the VisDrone 2018-DET dataset and the full class average accuracy Mean Average Precision (mAP), accuracy and precision. The accuracy and logarithm of loss values of the improved Faster R-CNN algorithm and the original Faster R-CNN algorithm's Region Proposal Network (RPN) were 0.985, 0.981, 0.018, and 0.052, respectively. The target detection model based on the hybrid attention mechanism of ResNet-50 network fusion Spartial Attention Module (SAM) and Channel Attention Module (CAM) had the best classification performance, with the accuracy, precision and overall average accuracy of 25.8 %, 24.6 % and 22.7 %, respectively. The results are helpful to improve the target detection ability of UAV in complex environment, and contribute to the development of target detection technology in the future.
Read full abstract