Abstract

The current advances in object detection depend on large-scale datasets to get good performance. However, there may not always be sufficient samples in many scenarios, resulting in the performance degradation of the current deep learning based object detection models. To overcome this problem, we propose a novel one-shot conditional detection framework (OSCD). Given a support image of the target object and a query image as input, OSCD can detect all objects belonging to the target object category in the query image. Specifically, OSCD is composed of a Siamese network and a two-stage detection model. In each stage of the two-stage detection pipeline, a feature fusion module and a learnable metric module are designed for effective conditional detection respectively. Once trained, OSCD can detect objects of both seen and unseen classes without further training, which also has advantages including class-agnostic, training-free for unseen classes, and without catastrophic forgetting. Experiments show that the proposed approach achieves state-of-the-art performance on the proposed datasets based on Fashion-MNIST and Pascal VOC.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.