Abstract
Few-shot object detection (FSOD), which aims to teach machines to detect objects belonging to novel classes via extremely few annotated data, has attracted extensive research interest. However, the performance of FSOD is still limited by the lack of data. Visual information of novel objects has significant intraclass variance under the few-shot setting, so single visual information cannot accurately represent the objects themselves. In contrast, humans are good at combining visual and semantic systems to recognize new concepts simultaneously. In this paper, we fully explore utilizing additional semantic knowledge to assist the FSOD task. Concretely, we first obtain the semantic representation of classes by the word embedding model learned from a large corpus of text. We then design a semantic enhancement (SE) module to enhance the incomprehensively visual representation of novel classes. To further improve the classification performance, we define a semantic prototype contrastive (SPC) loss to learn a more discriminative embedding space, where features to be detected belonging to the same class are compactly clustered around the corresponding semantic representation. Furthermore, we also introduce the semantic margin between different semantic representations for SPC loss to adaptively separate the margin between features belonging to different classes. Extensive experiments on the PASCAL VOC and MS-COCO benchmarks demonstrate that the proposed method achieves state-of-the-art performance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.