Exploiting the structure of furthest neighbor search for fast approximate results

Ryan R Curtin,Javier Echauz,Andrew B Gardner

doi:10.1016/j.is.2017.12.010

Abstract

We present a novel strategy for approximate furthest neighbor search that selects a set of candidate points using the data distribution. This strategy leads to an algorithm, which we call DrusillaSelect, that is able to outperform existing approximate furthest neighbor strategies. Our strategy is motivated by a study of the behavior of the furthest neighbor search problem, which has significantly different structure than the nearest neighbor search problem, and can be understood with the help of an information-theoretic hardness measure that we introduce. We also present a variant of the algorithm that gives an absolute approximation guarantee; under some assumptions, the guaranteed approximation can be achieved in provably less time than brute-force search. Performance studies indicate that DrusillaSelect can achieve comparable levels of approximation to other algorithms, even on the hardest datasets, while giving up to an order of magnitude speedup. An implementation is available in the mlpack machine learning library (found at http://www.mlpack.org).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploiting the structure of furthest neighbor search for fast approximate results

Abstract

Talk to us

Similar Papers

More From: Information Systems

Lead the way for us

Journal: Information Systems	Publication Date: Jan 5, 2018
Citations: 6

Similar Papers

Fast Approximate Furthest Neighbors with Data-Dependent Candidate Selection
Ryan R Curtin ... Andrew B Gardner
-
Ryan R Curtin, et. al.Ryan R Curtin ... Andrew B Gardner
01 Jan 2015
01 Jan 2015

Large scale nearest neighbor search -- theories, algorithms, and applications
...
-
, et. al. ...
01 Jan 2014
01 Jan 2014

Lower bounds for high dimensional nearest neighbor search and related problems
Allan Borodin ... Rafail Ostrovsky
-
Allan Borodin, et. al.Allan Borodin ... Rafail Ostrovsky
01 May 1999
01 May 1999

Algorithm Engineering for High-Dimensional Similarity Search Problems (Invited Talk)

-

02 Jul 2020
02 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploiting the structure of furthest neighbor search for fast approximate results

Abstract

Talk to us

Similar Papers

More From: Information Systems