Abstract

Zero-shot detection (ZSD) is a hot topic, which can detect unseen object without training on it. This new approach rises several challenges, e.g., the imbalance between positive and negative data, ambiguity between background and unseen classes, and how to align visual feature and semantic concept. In this paper, we apply an end-to-end zero-shot network to fulfil ZSD, where polarity loss is utilized to train detection model. polarity loss puts more importance on difficult data to reduce class imbalance and can maximize the difference between positive and negative predictions. Then, we use a semantic embedding method that use word vectors in both classification and regression subnets to better align visual features and semantic features. Finally, we use MS-COCO dataset as seen classes to train detection model, then we transfer the model for our own to detect warship object. Our model performs good on unseen classes and test accuracy of using polarity loss is apparently higher than that of using focal loss.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call