Abstract

In this paper we study the (Bichromatic) Maximum Inner Product Problem (Max-IP), in which we are given sets A and B of vectors, and the goal is to find a ∈ A and b ∈ B maximizing inner product a · b. Max-IP is very basic and serves as the base problem in the recent breakthrough of [Abboud et al., FOCS 2017] on hardness of approximation for polynomial-time problems. It is also used (implicitly) in the argument for hardness of exact e2-Furthest Pair (and other important problems in computational geometry) in poly-log-log dimensions in [Williams, SODA 2018]. We have three main results regarding this problem.• Characterization of Multiplicative Approximation. First, we study the best multiplicative approximation ratio for Boolean Max-IP in sub-quadratic time. We show that, for Max-IP with two sets of n vectors from {0, 1}d, there is an n2−ω(1) time (d/logn)ω(1) - multiplicative-approximating algorithm, and we show this is conditionally optimal, as such a (d/logn)o(1)-approximating algorithm would refute SETH. Similar characterization is also achieved for additive approximation for Max-IP.• 2O(log* n)-dimensional Hardness for Exact Max-IP Over The Integers. Second, we revisit the hardness of solving Max-IP exactly for vectors with integer entries. We show that, under SETH, for Max-IP with sets of n vectors from Zd for some d = 2O(log* n), every exact algorithm requires n2−o(1) time. With the reduction from [Williams, SODA 2018], it follows that e2-Furthest Pair and Bichromatic e2-Closest Pair in 2O(log* n) dimensions require n2−o(1) time.• Connection with NP · UPP Communication Protocols. Last, We establish a connection between conditional lower bounds for exact Max-IP with integer entries and NP · UPP communication protocols for Set-Disjointness, parallel to the connection between conditional lower bounds for approximating Max-IP and MA communication protocols for Set-Disjointness.The lower bound in our first result is a direct corollary of the new MA protocol for Set-Disjointness introduced in [Rubinstein, STOC 2018], and our algorithms utilize the polynomial method and simple random sampling. Our second result follows from a new dimensionality self reduction from the Orthogonal Vectors problem for n vectors from {0, 1}d to n vectors from Ze where e = 2O(log* d), dramatically improving the previous reduction in [Williams, SODA 2018]. The key technical ingredient is a recursive application of Chinese Remainder Theorem.As a side product, we obtain an MA communication protocol for Set-Disjointness with complexity [EQUATION], slightly improving the [EQUATION] log n) bound [Aaronson and Wigderson, TOCT 2009], and approaching the [EQUATION] lower bound [Klauck, CCC 2003].Moreover, we show that (under SETH) one can apply the [EQUATION] BQP communication protocol for Set-Disjointness to prove near-optimal hardness for approximation to Max-IP with vectors in {− 1, 1}d. This answers a question from [Abboud et al., FOCS 2017] in the affirmative.

Highlights

  • Under SETH, for Maximum Inner Product Problem (Max-IP) with sets of n vectors from Zd for some d = 2O(log∗ n), every exact algorithm requires n2−o(1) time

  • Maximum Inner Product Search is a fundamental similarity search problem in which you want to maintain a collection of vectors S, and answer queries of the form that given a new vector q, find the vector in S which is the most correlated to q

  • In this paper we consider a natural offline version of Maximum Inner Product Search, in which you are only required to compute the maximum correlation between S and all queries, and queries are given in advance

Read more

Summary

Introduction

Maximum Inner Product Search is a fundamental similarity search problem in which you want to maintain a collection of vectors S, and answer queries of the form that given a new vector q, find the vector in S which is the most correlated to q (or an approximation to it). This problem is closely related to another fundamental problem called nearest neighbor search, in which one needs to maintain a collection of points, and find the nearest neighbor for the query points (or an approximation to it). ON THE HARDNESS OF APPROXIMATE AND EXACT (BICHROMATIC) MAXIMUM INNER PRODUCT defined as: given two sets A, B each consisting of n vectors from {0, 1}d compute. We use Z-Max-IPn,d (R-Max-IPn,d) to denote the same problem, but with A, B being sets of vectors from Zd (Rd)

Hardness of approximate Max-IP
Hardness of exact Z-Max-IP
Hypothesis assumed in this paper
Characterizations of hardness of multiplicative approximations to Max-IP
Characterizations of hardness of additive approximations to Max-IP
Our results on Z-Max-IP
Improved dimensionality reduction for OV and Hopcroft’s problem
Improved MA protocols for Set-Disjointness
Intuition for dimensionality self-reduction for OV
Related work
Preliminaries
Fast rectangular matrix multiplication
Number theory
Communication complexity
Derandomization
The multiplicative case
The additive case
Adaptation to All-Pair-Max-IP
Improved hardness for LCS-Closest Pair problem
Improved dimensionality reduction for OV
Improved hardness for Hopcroft’s problem
Hardness for Z-Max-IP
A dimensionality reduction for Max-IP
Hardness for 2-Furthest Pair and Bichromatic 2-Closest Pair
Nonuniform to uniform transformation for dimensionality reduction for OV
Improved MA protocols
Future work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call