Near-neighbor preserving dimension reduction via coverings for doubling subsets of ℓ1

Ioannis Z Emiris,Vasilis Margonis,Ioannis Psarros

doi:10.1016/j.tcs.2022.11.031

Abstract

Randomized dimensionality reduction has been recognized as one of the cornerstones in handling high-dimensional data, originating in various foundational works such as the celebrated Johnson-Lindenstrauss Lemma. More specifically, nearest neighbor-preserving embeddings exist for ℓ2 (Euclidean) and ℓ1 (Manhattan) metrics, as well as doubling subsets of ℓ2, where doubling dimension is today the most effective way of capturing intrinsic dimensionality, as well as input structure in various applications. These randomized embeddings bound the distortion only for distances between the query point and a point set. Motivated by the foundational character of fast Approximate Nearest Neighbor search in ℓ1, this paper settles an important missing case, namely that of doubling subsets of ℓ1. In particular, we introduce a randomized dimensionality reduction by means of a near neighbor-preserving embedding, which is related to the decision-with-witness problem. The input set gets represented with a carefully chosen covering point set; in a second step, the algorithm randomly projects the latter. In order to obtain the covering point sets, we leverage either approximate r-nets or randomly shifted grids, with different tradeoffs between preprocessing time and target dimension. We exploit Cauchy random variables, and derive a concentration bound of independent interest. Our algorithms are rather simple and should therefore be useful in practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Near-neighbor preserving dimension reduction via coverings for doubling subsets of ℓ1

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

Near-Neighbor Preserving Dimension Reduction for Doubling Subsets of L1
...
-
, et. al. ...
07 Dec 2019
07 Dec 2019

On approximate nearest neighbors in non-Euclidean spaces
P Indyk
-
P IndykP Indyk
08 Nov 1998
08 Nov 1998

On Approximate Nearest Neighbors under l∞ Norm
Piotr Indyk
Journal of Computer and System Sciences | VOL. 63
Piotr IndykPiotr Indyk
01 Dec 2001
Journal of Computer and System Sciences | VOL. 63

Randomized approximation algorithms for planar visibility counting problem
Sharareh Alipour ... Amir Jafari
Theoretical Computer Science | VOL. 707
Sharareh Alipour, et. al.Sharareh Alipour ... Amir Jafari
19 Oct 2017
Theoretical Computer Science | VOL. 707

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Near-neighbor preserving dimension reduction via coverings for doubling subsets of ℓ1

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science