Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual Retrieval

Zhe Ma,Shouling Ji,Feng Qian,Sifeng He,Zonghui Wang,Zhenguang Liu,Xuhong Zhang,Lei Yang,Jianfeng Dong,Xiaobo Zhang

doi:10.1609/aaai.v38i5.28207

Abstract

Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowledge from off-the-shelf pre-trained retrieval models to a lightweight student model for efficient visual retrieval. Furthermore, we discover that the similarities obtained by different retrieval models are diversified and incommensurable, which makes it challenging to jointly distill knowledge from multiple models. Therefore, we propose to whiten the output of teacher models before fusion, which enables effective multi-teacher distillation for retrieval models. Whiten-MTD is conceptually simple and practically effective. Extensive experiments on two landmark image retrieval datasets and one video retrieval dataset demonstrate the effectiveness of our proposed method, and its good balance of retrieval performance and efficiency. Our source code is released at https://github.com/Maryeon/whiten_mtd.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

Neural models for information retrieval without labeled data
Hamed Zamani
ACM SIGIR Forum | VOL. 53
Hamed ZamaniHamed Zamani
01 Dec 2019
ACM SIGIR Forum | VOL. 53

Improving Cross-lingual Information Retrieval on Low-Resource Languages via Optimal Transport Distillation
Zhiqi Huang ... James Allan
-
Zhiqi Huang, et. al.Zhiqi Huang ... James Allan
27 Feb 2023
27 Feb 2023

How to Visually Retrieve Images from the St. Andrews Collection Using GIFT
Henning Müller ... Antoine Geissbühler
-
Henning Müller, et. al.Henning Müller ... Antoine Geissbühler
01 Jan 2004
01 Jan 2004

Cross-modal adapter for vision–language retrieval
Haojun Jiang ... Gao Huang
Pattern Recognition | VOL. 159
Haojun Jiang, et. al.Haojun Jiang ... Gao Huang
03 Nov 2024
Pattern Recognition | VOL. 159

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence