Generic compact representation through visual-semantic ambiguity removal

Yang Long,Yu Guan,Ling Shao

doi:10.1016/j.patrec.2018.04.024

Abstract

Zero-Shot Hashing (ZSH) aims to learn compact binary codes that can preserve semantic contents of the images from unseen categories. Conventional approaches project visual features to a semantic space that is shared by both seen and unseen categories. However, we observe that such a one-way paradigm suffers from the visual-semantic ambiguity problem. Namely, the semantic concepts (e.g. attributes) cannot explicitly correspond to visual patterns, and vice versa. Such a problem can lead to a huge variance in the visual features for each attribute. In this paper, we investigate how to remove such semantic ambiguity based on the observed visual appearances. In particular, we propose (1) a novel latent attribute space to mitigate the gap between visual appearances and semantic expressions; (2) a dual-graph regularised embedding algorithm called Visual-Semantic Ambiguity Removal (VSAR) that can simultaneously extract the shared components between visual and semantic information and mutually align the data distribution based on the intrinsic local structures of both spaces; (3) a new zero-shot hashing framework that can deal with both instance-level and category-level tasks. We validate our method on four popular benchmarks. Extensive experiments demonstrate that our proposed approach significantly performs the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generic compact representation through visual-semantic ambiguity removal

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Similar Papers

Attribute Embedding with Visual-Semantic Ambiguity Removal for Zero-shot Learning
Yang Long ... Ling Shao
-
Yang Long, et. al.Yang Long ... Ling Shao
01 Jan 2015
01 Jan 2015

The scope and limits of fine-grained image and category information in the ventral visual pathway.
Markus W Badwal ... Martin N Hebart
The Journal of neuroscience : the official journal of the Society for Neuroscience | VOL. -
Markus W Badwal, et. al.Markus W Badwal ... Martin N Hebart
06 Nov 2024
The Journal of neuroscience : the official journal of the Society for Neuroscience | VOL. -

Assessing multiscale visual appearance characteristics of neighbourhoods using geographically weighted principal component analysis in Shenzhen, China
Chao Wu ... Jinmeng Rao
Computers, Environment and Urban Systems | VOL. 84
Chao Wu, et. al.Chao Wu ... Jinmeng Rao
30 Sep 2020
Computers, Environment and Urban Systems | VOL. 84

Enhancing zero-shot object detection with external knowledge-guided robust contrast learning
Lijuan Duan ... Bian Ma
Pattern Recognition Letters | VOL. 185
Lijuan Duan, et. al.Lijuan Duan ... Bian Ma
05 Aug 2024
Pattern Recognition Letters | VOL. 185

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generic compact representation through visual-semantic ambiguity removal

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters