Abstract
With the rapid advancement of deep learning technology, significant achievements have been made in the field of general object detection. However, challenges still remain in the detection of tiny objects. There are two main drawbacks: (1) static prior information makes the localization of tiny objects relatively fixed; (2) Intersection over Union (IoU) is highly sensitive to deviations in tiny objects. To this end, we propose a Dynamic Gaussian Distribution Fitting and Imitation Learning-Based Label Assignment (DILA) strategy for Tiny Object Detection. Specifically, to address the positional deviation of effective receptive fields at different network layers during Gaussian modeling, DILA first designs an Adaptive Dynamic Calculation Strategy (ADCS) to compute estimation factors for the effective receptive field in different feature spaces, dynamically modeling prior information using Gaussian distribution. Then, DILA introduces a new Balance of Gaussian Scaling-aware Metric (BGSM) to measure the similarity between tiny bounding boxes and predefined anchors, instead of using IoU, which is highly sensitive to tiny pixel shifts, for sample assignment, thereby providing a more accurate basis for label assignment. Finally, a Detail Information Imitation Compensation Module (DIM) is presented to improve and compensate for the detailed information of tiny objects that troubles label assignment, achieving balanced learning for tiny objects. The proposed DILA strategy can be seamlessly integrated into various anchor-based detectors. Extensive experiments were conducted on three publicly available datasets for tiny object detection. The results indicate that when DILA is embedded into Faster RCNN, it outperforms other state-of-the-art methods in terms of detection performance for tiny objects, achieving an improvement in average precision of 10.3%, 1.5%, and 4.2% on AI-TOD, SODA-D, and VisDrone2019, respectively. The source codes and results are available at: https://github.com/chnu-cpl/DILA.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have