Abstract

For a set of $n$ points in $\mathbb{R}^d$, and parameters $k$ and $\varepsilon$, we present a data structure that answers $(1+\varepsilon,k)$ approximate nearest neighbor queries in logarithmic time. Surprisingly, the space used by the data structure is $\widetilde{O}(n /k)$, where the $\widetilde{O}(\cdot)$ notation here hides terms that are exponential in $d$, roughly varying as $1/\varepsilon^d$; as such, the space used is sublinear in the input size if $k$ is sufficiently large. Our approach provides a novel way to summarize geometric data, such that meaningful proximity queries on the data can be carried out using this sketch. Using this, we provide a sublinear space data structure that can estimate the density of a point set under various measures, including (i) sum of distances of $k$ closest points to the query point and (ii) sum of squared distances of $k$ closest points to the query point. Our approach generalizes to other distance-based estimations of densities of similar flavor. We also study ...

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.