Abstract

We study classic streaming and sparse recovery problems using deterministic linear sketches, including ℓ1/ℓ1 and ℓ∞/ℓ1 sparse recovery problems (the latter also being known as ℓ1-heavy hitters), norm estimation, and approximate inner product. We focus on devising a fixed matrix A∈Rm×n and a deterministic recovery/estimation procedure which work for all possible input vectors simultaneously. Our results improve upon existing work, the following being our main contributions:• A proof that ℓ∞/ℓ1 sparse recovery and inner product estimation are equivalent, and that incoherent matrices can be used to solve both problems. Our upper bound for the number of measurements is m=O(ε-2min{logn,(logn/log(1/ε))2}), which holds for any 0<ε<1/2. We can also obtain fast sketching and recovery algorithms by making use of the Fast Johnson–Lindenstrauss transform. Both our running times and number of measurements improve upon previous work. We can also obtain better error guarantees than previous work in terms of a smaller tail of the input vector.• A new lower bound for the number of linear measurements required to solve ℓ1/ℓ1 sparse recovery. We show Ω(k/ε2+klog(n/k)/ε) measurements are required to recover an x′ with ‖x-x′‖1⩽(1+ε)‖xtail(k)‖1, where xtail(k) is x projected onto all but its largest k coordinates in magnitude.• A tight bound of m=Θ(ε-2log(ε2n)) on the number of measurements required to solve deterministic norm estimation, i.e., to recover ‖x‖2±ε‖x‖1.For all the problems we study, tight bounds are already known for the randomized complexity from previous work, except in the case of ℓ1/ℓ1 sparse recovery, where a nearly tight bound is known. Our work thus aims to study the deterministic complexities of these problems. We remark that some of the matrices used in our algorithms, although known to exist, currently are not yet explicit in the sense that deterministic polynomial time constructions are not yet known, although in all cases polynomial time Monte Carlo algorithms are known.

Highlights

  • For all the problems we study, tight bounds are already known for the randomized complexity from previous work, except in the case of 1/ 1 sparse recovery, where a nearly tight bound is known

  • In this work we provide new results for the point query problem as well as several other related problems: approximate inner product, 1/ 1 sparse recovery, Supported by NSF grant CCF-0832797

  • It may be too expensive for the servers to communicate their individual databases to a centralized server, or to compute the frequency exactly

Read more

Summary

Introduction

In this work we provide new results for the point query problem as well as several other related problems: approximate inner product, 1/ 1 sparse recovery, Supported by NSF grant CCF-0832797. Supported in part by NSF grant CCF-0832797 and a Gordon Wu fellowship. For many of these problems efficient randomized sketching and streaming algorithms exist, and we are interested in understanding the deterministic complexities of these problems

Applications
Notation and Problem Definitions
Our Contributions and Related Work
Point Query and Inner Product Estimation
Deterministic Norm Estimation and the Gelfand Width
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call