Abstract

Multi-label image classification is a fundamental task in aerial image processing, which automatically generates image annotations for better image content interpretation. Many existing methods realize multi-label classification through an image level, while they ignore the dependencies among labels and the cross-modal relations between labels and image features. In this paper, we propose a simple and intuitive multi-label classification method via adjacency-based label and feature co-embedding for aerial images. To be specific, we introduce an adjacency-based label embedding module to maintain the original label relationships in the semantic space. A label and feature co-embedding module is designed to enhance the text-image cross-modal interactions and to obtain the attention-based label-specific vectors, which effectively excavate the response relations between labels and images. Experiments on two benchmark aerial image multi-label datasets show that our approach achieves considerable performance compared with seven previous approaches. Besides, visualization analyses indicate the label embeddings learned by our model maintain a meaningful semantic topology, which explicitly exploit label-feature dependencies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.