Abstract

Recently, hashing methods which try to solve similarity-preserving approximate nearest search problem have obtained widely applications in various fields such as content-based image retrieval, object recognition and pose estimation. However, how to learn effective hash codes to describe the similarities in the large-scale database still remains as a NP-hard problem. Besides, a fatal problem lying in the existing hashing methods is that they usually threshold the real values to binary codes using single-bit quantization (SBQ) at the highest point density, which may destroy the data structure seriously. Due to this problem, double-bit quantization (DBQ) is proposed to solve the problem of SBQ by adaptively learning thresholds to quantize the real values to two bits, and achieves impressive results. However, one problem in DBQ is that it neglects the amount of the information contained in different data dimensions. In this paper, we propose a multi-bit quantization method based on bit allocation to quantize each projected dimension with variable bit numbers. Besides, different from existing methods of choosing threshold, we propose an incomplete coding manner by clustering to generate binary codes. Experiments on two large datasets demonstrate the feasibility of our method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call