WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval

Guanglong Xu,Jia Cai,Zhensheng Hu

doi:10.1142/s0219691322500540

Abstract

Zero-shot sketch-based image retrieval (ZSSBIR) aims at retrieving natural images given free hand-drawn sketches that may not appear during training. Previous approaches used semantic aligned sketch-image pairs or utilized memory expensive fusion layer for projecting the visual information to a low-dimensional subspace, which ignores the significant heterogeneous cross-domain discrepancy between highly abstract sketch and relevant image. This may yield poor performance in the training phase. To tackle this issue and overcome this drawback, we propose a Wasserstein distance-based cross-modal semantic network (WAD-CMSN) for ZSSBIR. Specifically, it first projects the visual information of each branch (sketch, image) to a common low-dimensional semantic subspace via Wasserstein distance in an adversarial training manner. Furthermore, a novel identity matching loss is employed to select useful features, which can not only capture complete semantic knowledge, but also alleviate the over-fitting phenomenon caused by the WAD-CMSN model. Experimental results on the challenging Sketchy (Extended) and TU-Berlin (Extended) datasets indicate the effectiveness of the proposed WAD-CMSN model over several competitors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing

Lead the way for us

Journal: International Journal of Wavelets, Multiresolution and Information Processing	Publication Date: Nov 30, 2022
Citations: 2

Similar Papers

Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-Based Image Retrieval
Anjan Dutta ... Zeynep Akata
-
Anjan Dutta, et. al.Anjan Dutta ... Zeynep Akata
01 Jun 2019
01 Jun 2019

Zero-Shot Sketch Based Image Retrieval Using Graph Transformer
Sumrit Gupta ... Ushasi Chaudhuri
-
Sumrit Gupta, et. al.Sumrit Gupta ... Ushasi Chaudhuri
21 Aug 2022
21 Aug 2022

Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability
Jialin Tian ... Fumin Shen
-
Jialin Tian, et. al.Jialin Tian ... Fumin Shen
12 Jun 2023
12 Jun 2023

Doodle to Search: Practical Zero-Shot Sketch-Based Image Retrieval
Sounak Dey ... Yi-Zhe Song
-
Sounak Dey, et. al.Sounak Dey ... Yi-Zhe Song
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing