SketchTrans: Disentangled Prototype Learning With Transformer for Sketch-Photo Recognition.

Cuiqun Chen,Meibin Qi,Bo Du,Mang Ye

doi:10.1109/tpami.2023.3337005

Abstract

Matching hand-drawn sketches with photos (a.k.a sketch-photo recognition or re-identification) faces the information asymmetry challenge due to the abstract nature of the sketch modality. Existing works tend to learn shared embedding spaces with CNN models by discarding the appearance cues for photo images or introducing GAN for sketch-photo synthesis. The former unavoidably loses discriminability, while the latter contains ineffaceable generation noise. In this paper, we start the first attempt to design an information-aligned sketch transformer (SketchTrans +) via cross-modal disentangled prototype learning, while the transformer has shown great promise for discriminative visual modelling. Specifically, we design an asymmetric disentanglement scheme with a dynamic updatable auxiliary sketch (A-sketch) to align the modality representations without sacrificing information. The asymmetric disentanglement decomposes the photo representations into sketch-relevant and sketch-irrelevant cues, transferring sketch-irrelevant knowledge into the sketch modality to compensate for the missing information. Moreover, considering the feature discrepancy between the two modalities, we present a modality-aware prototype contrastive learning method that mines representative modality-sharing information using the modality-aware prototypes rather than the original feature representations. Extensive experiments on category- and instance-level sketch-based datasets validate the superiority of our proposed method under various metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SketchTrans: Disentangled Prototype Learning With Transformer for Sketch-Photo Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: May 1, 2024
Citations: 4

Similar Papers

Learning Inference Models for Computer Vision

-

31 Aug 2017
31 Aug 2017

Evaluation of Five Deep Learning Models for Crop Type Mapping Using Sentinel-2 Time Series Images with Missing Information
Hongwei Zhao ... Louis Reymondin
Remote Sensing | VOL. 13
Hongwei Zhao, et. al.Hongwei Zhao ... Louis Reymondin
15 Jul 2021
Remote Sensing | VOL. 13

Human Activity Recognition based on WaveNet
Tingting Hao
-
Tingting HaoTingting Hao
14 Jun 2021
14 Jun 2021

A Simplified Framework for Zero-shot Cross-Modal Sketch Data Retrieval
Ushasi Chaudhuri ... Biplab Banerjee
-
Ushasi Chaudhuri, et. al.Ushasi Chaudhuri ... Biplab Banerjee
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SketchTrans: Disentangled Prototype Learning With Transformer for Sketch-Photo Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence