Compact Binary Codes Research Articles

Fast person re-identification (ReID) aims to search person images quickly and accurately. The main idea of recent fast ReID methods is the hashing algorithm, which learns compact binary codes and performs fast Hamming distance and counting sort. However, a very long code is needed for high accuracy (e.g., 2048), which compromises search speed. In this work, we introduce a new solution for fast ReID by formulating a novel Coarse-to-Fine (CtF) hashing code search strategy, which complementarily uses short and long codes, achieving both faster speed and better accuracy. It uses shorter codes to coarsely rank broad matching similarities and longer codes to refine only a few top candidates for more accurate instance ReID. Specifically, we design an All-in-One (AiO) module together with a Distance Threshold Optimization (DTO) algorithm. In AiO, we simultaneously learn and enhance multiple codes of different lengths in a single model. It learns multiple codes in a pyramid structure, and encourage shorter codes to mimic longer codes by self-distillation. DTO solves a complex threshold search problem by a simple optimization process, and the balance between accuracy and speed is easily controlled by a single parameter. It formulates the optimization target as a Fβ score that can be optimised by Gaussian cumulative distribution functions. Besides, we find even short code (e.g., 32) still takes a long time under large-scale gallery due to the O(n) time complexity. To solve the problem, we propose a gallery-size-free latent-attributes-based One-Shot-Filter (OSF) strategy, that is always O(1) time complexity, to quickly filter major easy negative gallery images, Specifically, we design a Latent-Attribute-Learning (LAL) module supervised a Single-Direction-Metric (SDM) Loss. LAL is derived from principal component analysis (PCA) that keeps largest variance using shortest feature vector, meanwhile enabling batch and end-to-end learning. Every logit of a feature vector represents a meaningful attribute. SDM is carefully designed for fine-grained attribute supervision, outperforming common metrics such as Euclidean and Cosine metrics. Experimental results on 2 datasets show that CtF+OSF is not only 2% more accurate but also 5× faster than contemporary hashing ReID methods. Compared with non-hashing ReID methods, CtF is 50× faster with comparable accuracy. OSF further speeds CtF by 2× again and upto 10× in total with almost no accuracy drop.

Read full abstract

Deep hashing has great potential in large-scale visual similarity search due to its preferable efficiency in storage and computation. Technically, deep hashing for visual similarity search inherits the powerful representation capability of deep neural networks, and it encodes visual features into compact binary codes by preserving representative semantic visual features. Works in this field mainly focus on building the relationship between the visual and objective hash spaces, while they seldom study the triadic cross-domain semantic knowledge transfer among visual, semantic, and hashing spaces, leading to a serious semantic ignorance problem during space transformation. In this article, we propose a novel deep tripartite semantically interactive hashing framework, dubbed Semantically Cycle-consistent Hashing Networks (SCHNs), for discriminative hash code learning. Particularly, we construct a flexible semantic space and a transitive latent space, in conjunction with the visual space, to jointly deduce the privileged discriminative hash space. Specifically, a new semantic space is conceived to strengthen the flexibility and completeness of categories in the semantic feature inference phase. At the same time, a transitive latent space is formulated to explore and uncover the shared semantic interactivity embedded in visual and semantic features. Moreover, to further ensure semantic consistency across multiple spaces, we propose to build a cyclic adversarial learning module to preserve and keep their semantic concurrence during space transformation. Notably, our SCHN, for the first time, establishes the cyclic principle of deep semantic-preserving hashing by adaptive semantic parsing across different spaces in a single-modal visual similarity search. In addition, the entire learning framework is jointly optimized in an end-to-end manner. Extensive experiments performed on diverse large-scale datasets evidence the superiority of our method against other state-of-the-art deep hashing algorithms. The source codes of this article are available at https://github.com/JalinWang/SCHN.

Read full abstract

Compact Binary Codes Research Articles

Related Topics

Articles published on Compact Binary Codes

Wasm-R3: Record-Reduce-Replay for Realistic and Standalone WebAssembly Benchmarks

Graph-Collaborated Auto-Encoder Hashing for Multiview Binary Clustering.

Faster Person Re-Identification: One-Shot-Filter and Coarse-to-Fine Search.

An improved deep hashing model for image retrieval with binary code similarities

Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey

Deep hashing image retrieval based on hybrid neural network and optimized metric learning

Deep Adaptive Quadruplet Hashing With Probability Sampling for Large-Scale Image Retrieval

Similarity preserving hashing for appliance identification based on V-I trajectory

Cover Image

EDMH: Efficient discrete matrix factorization hashing for multi-modal similarity retrieval

Deep quantization network with visual-semantic alignment for zero-shot image retrieval

RelaHash: Deep Hashing With Relative Position

Hybrid-attention based Feature-reconstructive Adversarial Hashing Networks for Cross-modal Retrieval

Hadamard matrix-guided multi-modal hashing for multi-modal retrieval

Contrastive hashing with vision transformer for image retrieval

Binary multi-modal matrix factorization for fast item cold-start recommendation

Satellite image search in AgoraEO

Leveraging Deep Features Enhance and Semantic-Preserving Hashing for Image Retrieval

Deep Contrastive Self-Supervised Hashing for Remote Sensing Image Retrieval

Discriminative Visual Similarity Search with Semantically Cycle-consistent Hashing Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Compact Binary Codes Research Articles

Related Topics

Articles published on Compact Binary Codes

Wasm-R3: Record-Reduce-Replay for Realistic and Standalone WebAssembly Benchmarks

Graph-Collaborated Auto-Encoder Hashing for Multiview Binary Clustering.

Faster Person Re-Identification: One-Shot-Filter and Coarse-to-Fine Search.

An improved deep hashing model for image retrieval with binary code similarities

Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey

Deep hashing image retrieval based on hybrid neural network and optimized metric learning

Deep Adaptive Quadruplet Hashing With Probability Sampling for Large-Scale Image Retrieval

Similarity preserving hashing for appliance identification based on V-I trajectory

Cover Image

EDMH: Efficient discrete matrix factorization hashing for multi-modal similarity retrieval

Deep quantization network with visual-semantic alignment for zero-shot image retrieval

RelaHash: Deep Hashing With Relative Position

Hybrid-attention based Feature-reconstructive Adversarial Hashing Networks for Cross-modal Retrieval

Hadamard matrix-guided multi-modal hashing for multi-modal retrieval

Contrastive hashing with vision transformer for image retrieval

Binary multi-modal matrix factorization for fast item cold-start recommendation

Satellite image search in AgoraEO

Leveraging Deep Features Enhance and Semantic-Preserving Hashing for Image Retrieval

Deep Contrastive Self-Supervised Hashing for Remote Sensing Image Retrieval

Discriminative Visual Similarity Search with Semantically Cycle-consistent Hashing Networks