Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval.

Cheng Deng,Dacheng Tao,Muli Yang,Hao Wang,Xinxun Xu

doi:10.1109/tip.2020.3020383

Abstract

Zero-shot sketch-based image retrieval (ZS-SBIR) is a specific cross-modal retrieval task that involves searching natural images through the use of free-hand sketches under the zero-shot scenario. Most previous methods project the sketch and image features into a low-dimensional common space for efficient retrieval, and meantime align the projected features to their semantic features (e.g., category-level word vectors) in order to transfer knowledge from seen to unseen classes. However, the projection and alignment are always coupled; as a result, there is a lack of alignment that consequently leads to unsatisfactory zero-shot retrieval performance. To address this issue, we propose a novel progressive cross-modal semantic network. More specifically, it first explicitly aligns the sketch and image features to semantic features, then projects the aligned features to a common space for subsequent retrieval. We further employ cross-reconstruction loss to encourage the aligned features to capture complete knowledge about the two modalities, along with multi-modal Euclidean loss that guarantees similarity between the retrieval features from a sketch-image pair. Extensive experiments conducted on two popular large-scale datasets demonstrate that our proposed approach outperforms state-of-the-art competitors to a remarkable extent: by more than 3% on the Sketchy dataset and about 6% on the TU-Berlin dataset in terms of retrieval accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2020
Citations: 90

Similar Papers

Progressive Domain-Independent Feature Decomposition Network for Zero-Shot Sketch-Based Image Retrieval
Xinxun Xu ... Yanhua Yang
-
Xinxun Xu, et. al.Xinxun Xu ... Yanhua Yang
01 Jul 2020
01 Jul 2020

Cross-Domain Alignment for Zero-Shot Sketch-Based Image Retrieval
Xu Wang ... Dezhong Peng
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Xu Wang, et. al.Xu Wang ... Dezhong Peng
01 Nov 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
Zhipeng Wang ... Cheng Deng
-
Zhipeng Wang, et. al.Zhipeng Wang ... Cheng Deng
01 Aug 2021
01 Aug 2021

Augmented Multimodality Fusion for Generalized Zero-Shot Sketch-Based Visual Retrieval.
Taotao Jing ... Haifeng Xia
IEEE Transactions on Image Processing | VOL. 31
Taotao Jing, et. al.Taotao Jing ... Haifeng Xia
01 Jan 2021
IEEE Transactions on Image Processing | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing