Abstract

Recently, gigapixel photography has been developed considerably and gradually put into remote sensing, video surveillance, etc. Gigapixel images have a visible field of view area at the square-kilometer level (containing thousands of targets) and up to 100 times the scale variation. Among them, the differences in target pose, scale, and occlusion are huge, and most existing target detection algorithms cannot directly process them. To solve these problems, we propose a new multi-target pedestrian and vehicle detector PVDet (Towards Pedestrian and Vehicle Detection on Gigapixel-level images) for gigapixel-level images. First, the DPRNet (Deformable deeP Residual Network) is designed as the backbone network to enhance the effective perceptual field and improve the feature representation of pose varying and occluded targets. Then, the PAFPN (Path Aggregation Feature Pyramid Network) is adopted to process the multi-scale features extracted by the backbone, boosting the multi-scale target modeling capability and the localization of small targets. Finally, the DyHead module is introduced to enhance the detection head’s scale, spatial and task awareness, further optimizing pedestrian and vehicle classification and localization. Compared with other State-of-the-Art methods on the PANDA dataset, the experimental results show that the proposed method dramatically improves AP of pedestrian and vehicle detection in gigapixel-level images by 10.4 AP over baseline, which is better than the existing target detection algorithms. We also conducted experiments on the PASCAL VOC 2012 dataset to further demonstrate the generalization capability and effectiveness of the proposed method.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.