Detecting drones in infrared videos is highly desired in many realistic scenarios, e.g., unauthorized drone monitoring around airports. Nevertheless, automated drone detection is rather challenging when the targets appear as tiny objects (≤10×10 pixels) against complex backgrounds. Conventional object detection algorithms, which mainly use static visual features, can hardly distinguish tiny objects from undesired artefacts in complex backgrounds. To alleviate this problem, we learn from the early biological visual pathway (including the parvocellular and magnocellular pathways), which process static and motion information simultaneously. Therefore, we propose a magnocellular inspired method for video tiny-object detection (Magno-VTOD) that integrates both static and motion visual information. The Magno-VTOD firstly employs a retinal magnocellular computation model to extract the motion strength of moving objects. The motion responses are then used to enhance the areas of the flying tiny drones effectively and efficiently, thereby facilitating the subsequent target detection procedure. We implement the video tiny-object detection method based on the widely adopted deep neural networks guided by the magnocellular computation model. Experimental results obtained on the large-scale Anti-UAV dataset (304451 video frames) validate that the proposed Magno-VTOD method significantly outperforms the competing state-of-the-art object detection methods on the tiny drone detection task. Particularly, the AP value is increased by 15.4% for tiny object detection, and by 17.1%/13.7% against wood/mountain backgrounds.
Read full abstract