Stochastic Streams: Sample Complexity vs. Space Complexity.

Michael S Crouch ,Andrew Mcgregor ,Gregory Valiant ,David P Woodruff

doi:10.4230/lipics.esa.2016.32

Abstract

We address the trade-off between the computational resources needed to process a large data set and the number of samples available from the data set. Specifically, we consider the following abstraction: we receive a potentially infinite stream of IID samples from some unknown distribution D, and are tasked with computing some function f(D). If the stream is observed for time t, how much memory, s, is required to estimate f(D)? We refer to t as the sample complexity and s as the space complexity. The main focus of this paper is investigating the trade-offs between the space and sample complexity. We study these trade-offs for several canonical problems studied in the data stream model: estimating the collision probability, i.e., the second moment of a distribution, deciding if a graph is connected, and approximating the dimension of an unknown subspace. Our results are based on techniques for simulating different classical sampling procedures in this model, emulating random walks given a sequence of IID samples, as well as leveraging a characterization between communication bounded protocols and statistical query algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic Streams: Sample Complexity vs. Space Complexity.

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Algorithms for Sequence Alignment

-

26 Sep 2017
26 Sep 2017

Improved (Provable) Algorithms for the Shortest Vector Problem via Bounded Distance Decoding
...
-
, et. al. ...
02 Mar 2021
02 Mar 2021

Anytime Prediction and Learning for the Balance between Computation and Accuracy

-

27 Jun 2019
27 Jun 2019

Tight Tradeoffs for Real-Time Approximation of Longest Palindromes in Streams.
...
-
, et. al. ...
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic Streams: Sample Complexity vs. Space Complexity.

Abstract

Talk to us

Similar Papers