Abstract

High-dimensional indexing methods have been proved quite useful for response time improvement. Based on Euclidian distance, many of them have been proposed for applications where data vectors are high-dimensional. However, these methods do not generally support efficiently similarity search when dealing with heterogeneous data vectors. In this paper, we propose a high-dimensional indexing method (KRA +-Blocks) as an extension of the region approximation approach to the kernel space. KRA +-Blocks combines nonlinear dimensionality reduction technique (KPCA) with region approximation approach to map data vectors into a reduced feature space. The created feature space is then used, on one hand to approximate regions, and on the other hand to provide an effective kernel distances for both filtering process and similarity measurement. In this way, the proposed approach achieves high performances in response time and in precision when dealing with high-dimensional and heterogeneous vectors.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call