InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images

Konstantin Kobs,Michael Steininger,Andreas Hotho

doi:10.1109/wacv56688.2023.00112

Abstract

Common Deep Metric Learning (DML) datasets specify only one notion of similarity, e.g., two images in the Cars196 dataset are deemed similar if they show the same car model. We argue that depending on the application, users of image retrieval systems have different and changing similarity notions that should be incorporated as easily as possible. Therefore, we present Language-Guided Zero-Shot Deep Metric Learning (LanZ-DML) as a new DML setting in which users control the properties that should be important for image representations without training data by only using natural language. To this end, we propose InDiReCT (Image representations using Dimensionality Reduction on CLIP embedded Texts), a model for LanZ-DML on images that exclusively uses a few text prompts for training. InDiReCT utilizes CLIP as a fixed feature extractor for images and texts and transfers the variation in text prompt embeddings to the image embedding space. Extensive experiments on five datasets and overall thirteen similarity notions show that, despite not seeing any images during training, InDiReCT performs better than strong baselines and approaches the performance of fully-supervised models. An analysis reveals that InDiReCT learns to focus on regions of the image that correlate with the desired similarity notion, which makes it a fast to train and easy to use method to create custom embedding spaces only using natural language.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Deep Metric Learning by Divide and Conquer.
Artsiom Sanakoyeu ... Pingchuan Ma
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Artsiom Sanakoyeu, et. al.Artsiom Sanakoyeu ... Pingchuan Ma
01 Jan 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Sanjay Aneja
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Sanjay Aneja
01 Jul 2021
Cancer Research | VOL. 81

Deep Metric Learning With Density Adaptivity
Yehao Li ... Ting Yao
IEEE Transactions on Multimedia | VOL. 22
Yehao Li, et. al.Yehao Li ... Ting Yao
12 Sep 2019
IEEE Transactions on Multimedia | VOL. 22

Towards Improved Proxy-Based Deep Metric Learning via Data-Augmented Domain Adaptation
Li Ren ... Chen Chen
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Li Ren, et. al.Li Ren ... Chen Chen
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images

Abstract

Talk to us

Similar Papers