Abstract

Unmanned Aerial Vehicle (UAV) has been widely applied for aerial object detection. Recently, this issue has become a research hot spot in the field of computer vision. Currently, the performance of conventional methods to detect small objects has reached a breakthrough point. Moreover, the dense distribution and large-scale variation of objects in aerial images significantly affect the detection accuracy. In order to resolve this problem, a novel structure based on the YOLOv3 is proposed in the present study. To this end, the backbone network is replaced by ResNet_vd50 to prevent information loss in the downsampling process. Then the backbone network is modified by deformable convolution to improve the detection ability of deformed objects. In this regard, SE (Squeeze-and-Excitation) attention is embedded in the ResNet_vd block to improve the expression ability of features. Furthermore, the Soft-NMS (Soft Non Maximum Suppression) algorithm is introduced for bounding box fusion to resolve the occlusion problem. Finally, the MixUp method is used in the data augmentation stage to enrich the background information by fusing different images. Based on the obtained results, it is concluded that the proposed method has higher accuracy in aerial images than state of art object detection methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.