Abstract

As a research hotspot of computer vision, crowd counting methods have achieved success in natural images. But crowd counting in aerial images are rarely explored, and existing methods do not perform well because of the higher resolution, smaller object scale and more complex scene. Therefore, this paper proposes a lightweight dual-task network (LDNet) for crowd counting, which only uses bifurcated structure to overcome these new challenges in aerial images without complicated pipelines. To realize this, a complete but efficient Guidance Branch is proposed to assist Counting Branch in fitting crowd distribution. Furthermore, a scene attention mechanism is used to consider the complex scene information, which are never considered by existing methods. Our LD-Net outperforms existing methods on aerial crowd counting dataset (Visdrone), and gets better or comparable results on natural crowd counting datasets (UCF_CC_50, UCF_QNRF, ShanghaiTech Part A).

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call