Abstract

Vector quantization is one of the most popular techniques for approximate nearest neighbor (ANN) search. Over the past decade, many vector quantization methods have been proposed for ANN search. However, these methods do not strike a satisfactory balance between accuracy and efficiency because of their defects in quantization structures. To overcome this problem, a quantization method, named as Residual Vector Product Quantization (RVPQ), is proposed in this study. Under this method, the data space is decomposed into several subspaces, and a residual structure consisting of several ordered residual codebooks is constructed for each subspace. Learned by using an effective joint training algorithm, the quantization structure of RVPQ is much better than other methods, and it greatly enhances the performance of ANN search. Except that, an efficient residual quantization encoding method H-Variable Beam Search is also proposed to achieve higher encoding efficiency with negligible loss of accuracy. Furthermore, Inverted Multi-Index based on RVPQ is also designed to effectively solve the ANN search for a very large-scale database. Experimental results and theoretical evaluations show that RVPQ outperforms the-state-of-the-art methods on retrieval accuracy while retaining a comparable computational complexity.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call