OSCD: A one-shot conditional object detection framework

Kun Fu,Tengfei Zhang,Yue Zhang,Xian Sun

doi:10.1016/j.neucom.2020.04.092

Abstract

The current advances in object detection depend on large-scale datasets to get good performance. However, there may not always be sufficient samples in many scenarios, resulting in the performance degradation of the current deep learning based object detection models. To overcome this problem, we propose a novel one-shot conditional detection framework (OSCD). Given a support image of the target object and a query image as input, OSCD can detect all objects belonging to the target object category in the query image. Specifically, OSCD is composed of a Siamese network and a two-stage detection model. In each stage of the two-stage detection pipeline, a feature fusion module and a learnable metric module are designed for effective conditional detection respectively. Once trained, OSCD can detect objects of both seen and unseen classes without further training, which also has advantages including class-agnostic, training-free for unseen classes, and without catastrophic forgetting. Experiments show that the proposed approach achieves state-of-the-art performance on the proposed datasets based on Fashion-MNIST and Pascal VOC.

Full Text