On The Hardness of Approximate and Exact (Bichromatic) Maximum Inner Product

Lijie Chen

doi:10.4086/toc.2020.v016a004

Abstract

In this paper we study the (Bichromatic) Maximum Inner Product Problem (Max-IP), in which we are given sets A and B of vectors, and the goal is to find a ∈ A and b ∈ B maximizing inner product a · b. Max-IP is very basic and serves as the base problem in the recent breakthrough of [Abboud et al., FOCS 2017] on hardness of approximation for polynomial-time problems. It is also used (implicitly) in the argument for hardness of exact e2-Furthest Pair (and other important problems in computational geometry) in poly-log-log dimensions in [Williams, SODA 2018]. We have three main results regarding this problem.• Characterization of Multiplicative Approximation. First, we study the best multiplicative approximation ratio for Boolean Max-IP in sub-quadratic time. We show that, for Max-IP with two sets of n vectors from {0, 1}d, there is an n2−ω(1) time (d/logn)ω(1) - multiplicative-approximating algorithm, and we show this is conditionally optimal, as such a (d/logn)o(1)-approximating algorithm would refute SETH. Similar characterization is also achieved for additive approximation for Max-IP.• 2O(log* n)-dimensional Hardness for Exact Max-IP Over The Integers. Second, we revisit the hardness of solving Max-IP exactly for vectors with integer entries. We show that, under SETH, for Max-IP with sets of n vectors from Zd for some d = 2O(log* n), every exact algorithm requires n2−o(1) time. With the reduction from [Williams, SODA 2018], it follows that e2-Furthest Pair and Bichromatic e2-Closest Pair in 2O(log* n) dimensions require n2−o(1) time.• Connection with NP · UPP Communication Protocols. Last, We establish a connection between conditional lower bounds for exact Max-IP with integer entries and NP · UPP communication protocols for Set-Disjointness, parallel to the connection between conditional lower bounds for approximating Max-IP and MA communication protocols for Set-Disjointness.The lower bound in our first result is a direct corollary of the new MA protocol for Set-Disjointness introduced in [Rubinstein, STOC 2018], and our algorithms utilize the polynomial method and simple random sampling. Our second result follows from a new dimensionality self reduction from the Orthogonal Vectors problem for n vectors from {0, 1}d to n vectors from Ze where e = 2O(log* d), dramatically improving the previous reduction in [Williams, SODA 2018]. The key technical ingredient is a recursive application of Chinese Remainder Theorem.As a side product, we obtain an MA communication protocol for Set-Disjointness with complexity [EQUATION], slightly improving the [EQUATION] log n) bound [Aaronson and Wigderson, TOCT 2009], and approaching the [EQUATION] lower bound [Klauck, CCC 2003].Moreover, we show that (under SETH) one can apply the [EQUATION] BQP communication protocol for Set-Disjointness to prove near-optimal hardness for approximation to Max-IP with vectors in {− 1, 1}d. This answers a question from [Abboud et al., FOCS 2017] in the affirmative.

Highlights

Under SETH, for Maximum Inner Product Problem (Max-IP) with sets of n vectors from Zd for some d = 2O(log∗ n), every exact algorithm requires n2−o(1) time
Maximum Inner Product Search is a fundamental similarity search problem in which you want to maintain a collection of vectors S, and answer queries of the form that given a new vector q, find the vector in S which is the most correlated to q
In this paper we consider a natural offline version of Maximum Inner Product Search, in which you are only required to compute the maximum correlation between S and all queries, and queries are given in advance

Summary

Introduction

Maximum Inner Product Search is a fundamental similarity search problem in which you want to maintain a collection of vectors S, and answer queries of the form that given a new vector q, find the vector in S which is the most correlated to q (or an approximation to it). This problem is closely related to another fundamental problem called nearest neighbor search, in which one needs to maintain a collection of points, and find the nearest neighbor for the query points (or an approximation to it). ON THE HARDNESS OF APPROXIMATE AND EXACT (BICHROMATIC) MAXIMUM INNER PRODUCT defined as: given two sets A, B each consisting of n vectors from {0, 1}d compute. We use Z-Max-IPn,d (R-Max-IPn,d) to denote the same problem, but with A, B being sets of vectors from Zd (Rd)

Hardness of approximate Max-IP

Hardness of exact Z-Max-IP

Hypothesis assumed in this paper

Characterizations of hardness of multiplicative approximations to Max-IP

Characterizations of hardness of additive approximations to Max-IP

Our results on Z-Max-IP

Improved dimensionality reduction for OV and Hopcroft’s problem

Improved MA protocols for Set-Disjointness

Intuition for dimensionality self-reduction for OV

Related work

Preliminaries

Fast rectangular matrix multiplication

Number theory

Communication complexity

Derandomization

The multiplicative case

The additive case

Adaptation to All-Pair-Max-IP

Improved hardness for LCS-Closest Pair problem

Improved dimensionality reduction for OV

Improved hardness for Hopcroft’s problem

Hardness for Z-Max-IP

A dimensionality reduction for Max-IP

Hardness for 2-Furthest Pair and Bichromatic 2-Closest Pair

Nonuniform to uniform transformation for dimensionality reduction for OV

Improved MA protocols

Future work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theory of Computing	Publication Date: Jan 1, 2020
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

On The Hardness of Approximate and Exact (Bichromatic) Maximum Inner Product

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theory of Computing

Lead the way for us

Similar Papers

On the hardness of approximate and exact (bichromatic) maximum inner product

-

22 Jun 2018
22 Jun 2018

THE DATA STRUCTURE ACCELERATOR ARCHITECTURE
Richard Zippel
International Journal of High Speed Electronics and Systems | VOL. 07
Richard ZippelRichard Zippel
01 Dec 1996
International Journal of High Speed Electronics and Systems | VOL. 07

Subquadratic algorithms for algebraic generalizations of 3SUM
...
-
, et. al. ...
01 Jun 2017
01 Jun 2017

Subquadratic Algorithms for Algebraic 3SUM
Noam Solomon ... Aurélien Ooms
Discrete & Computational Geometry | VOL. 61
Noam Solomon, et. al.Noam Solomon ... Aurélien Ooms
21 Nov 2018
Discrete & Computational Geometry | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On The Hardness of Approximate and Exact (Bichromatic) Maximum Inner Product

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theory of Computing