Abstract

Finding tiny persons under the drone vision was, is and remains to be an integral and challenging task. Unmanned Aerial Vehicles (UAVs) with high-speed, low-altitude, and multi-perspective flight bring about violently various scales of objects, which burdens the optimization of models. Moreover, the detection performance of densely and faintly discernible person characteristics is far less than that of large objects in high-resolution aerial images. In this paper, we introduce the image cropping strategy and attention mechanism based on YOLOv5 to address small person detection in the optimized VisDrone2019 dataset. Specifically, we propose a Densely Cropped and Local Attention of object detector Network (DCLANet), which is inspired by the observation that less area occupied by small objects should be fully focused and relatively magnified in the original image. DCLANet assembled Density Map Guided Object Detection (DMNet) in Aerial Images and You Only Look Twice (YOLT): Rapid Multi-Scale Object Detection In Satellite Imagery to crop images upon training and testing stage, meanwhile, added Bottleneck Attention Mechanism (BAM) to YOLOv5 baseline framework, which more focus on person objects other than irrelevant categories. To achieve further improvement of DCLANet, we also provide bags of useful strategies: data augmentation, label fusion, category filtering and hyperparameter evolution. Extensive experiments on the VisDrone2019 show that DCLANet achieves state-of-the-art performance, the detection result of person category <formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex>$A P^{\rm{val}}$</tex></formula> @0.5 is 50.04% with test-dev subset, which is substantially better than the previous SOTA method(DPNetV3) by 12.01%. In addition, on our optimized VisDrone2019 dataset, <formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex>$A P^{\rm{val}}$</tex></formula> @0.5 and A P test@0.5 obtained 74.95%, 62.18%, respectively. Compared to YOLOv5, DCLANet improves 3.8% or so, which is encouraging and competitve.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.