Learning to Embed Semantic Similarity for Joint Image-Text Retrieval.

Noam Malali,Yosi Keller

doi:10.1109/tpami.2021.3132163

Learning to Embed Semantic Similarity for Joint Image-Text Retrieval.

Noam Malali, Yosi Keller

Open Access

https://doi.org/10.1109/tpami.2021.3132163

Copy DOI

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Dec 1, 2022
Citations: 7

Affiliation: Bar-Ilan University

Abstract
Full-Text PDF
Similar Papers

Abstract

We present a deep learning approach for learning the joint semantic embeddings of images and captions in a euclidean space, such that the semantic similarity is approximated by the L2 distances in the embedding space. For that, we introduce a metric learning scheme that utilizes multitask learning to learn the embedding of identical semantic concepts using a center loss. By introducing a differentiable quantization scheme into the end-to-end trainable network, we derive a semantic embedding of semantically similar concepts in euclidean space. We also propose a novel metric learning formulation using an adaptive margin hinge loss, that is refined during the training phase. The proposed scheme was applied to the MS-COCO, Flicke30K and Flickr8K datasets, and was shown to compare favorably with contemporary state-of-the-art approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.