Time Bounds for Streaming Problems

Raphael Clifford,Markus Jalsenius,Benjamin Sach

doi:10.4086/toc.2019.v015a002

Abstract

We give tight cell-probe bounds for the time to compute convolution, multiplication and Hamming distance in a stream. The cell probe model is a particularly strong computational model and subsumes, for example, the popular word RAM model. • We first consider online convolution where the task is to output the inner product between a fixed n-dimensional vector and a vector of the n most recent values from a stream. One symbol of the stream arrives at a time and the each output must be computed before the next symbols arrives. • Next we show bounds for online multiplication where the stream consists of pairs of digits, one from each of two n digit numbers that are to be multiplied. One pair arrives at a time and the task is to output a single new digit from the product before the next pair of digits arrives. • Finally we look at the online Hamming distance problem where the Hamming distance is outputted instead of the inner product. For each of these three problems, we give a lower bound of Ω ( δ w log n ) time on average per output, where δ is the number of bits needed to represent an input symbol and w is the cell or word size. We argue that these bound are in fact tight within the cell probe model.

Highlights

We consider the complexity of three related and fundamental problems: computing the convolution of two vectors, multiplying two integers, and computing the Hamming distance between two strings
We look at the online Hamming distance problem where the Hamming distance is computed instead of the inner product
For each of these three problems, we give a lower bound of Ω ((δ /w) log n) time on average per output symbol, where δ is the number of bits needed to represent an input symbol and w is the cell or word size. We argue that these bounds are tight within the cell probe model

Summary

Introduction

We consider the complexity of three related and fundamental problems: computing the convolution of two vectors, multiplying two integers, and computing the Hamming distance between two strings We study these problems in an online or streaming context and provide matching upper and lower bounds in the cell-probe model. In the cell-probe model with w bits per cell, for any positive integers q and n, and any randomised algorithm solving the online multiplication problem in base q, there exist instances such that computing the n least significant digits of the product takes Ω((δ /w)n log n) expected time, where δ = log q. In the cell-probe model with w bits per cell, for any positive integers q and n, and any randomised algorithm solving the online Hamming distance problem, there exist instances such that the expected amortised time is Ω((δ /w) log n) per arriving value, where δ = min{log q, log n}. These methods have been extended to allow searching for patterns in rapidly processed data streams [6, 9]

Previous results and upper bounds in the RAM model

The cell-probe model

Technical contributions

Organisation

Basic setup for the lower bounds

The framework

Hard distributions

Information transfer

Overall proofs of the lower bounds

An upper bound on the entropy

Lower bounds on the entropy

Lower bounds on the information transfer

Obtaining the cell-probe lower bounds

Hard distributions for the convolution problem

Entropy lower bound over all arrays F

Entropy lower bound with a fixed array F

Hard distributions for the multiplication problem

Hard distribution for the Hamming distance problem

The overall structure of the fixed string F

Properties of the string R and Hamming arrays

The hard distribution and obtaining the lower bound

A string with many different Hamming arrays

The structure of R

Vector sums and Hamming arrays

Vector sets with many distinct sums

Vectors and codes

Choosing the vectors in V

Many distinct sums for subsets of V

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theory of Computing	Publication Date: Jan 1, 2019
Citations: 36	License type: cc-by

R Discovery Prime

R Discovery Prime

Time Bounds for Streaming Problems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theory of Computing

Lead the way for us

Similar Papers

Lower Bounds for Online Integer Multiplication and Convolution in the Cell-Probe Model
Raphaël Clifford ... Markus Jalsenius
-
Raphaël Clifford, et. al.Raphaël Clifford ... Markus Jalsenius
01 Jan 2010
01 Jan 2010

Cell-probe bounds for online edit distance and other pattern matching problems
...
-
, et. al. ...
04 Jan 2015
04 Jan 2015

Tight cell-probe bounds for online Hamming distance computation
...
-
, et. al. ...
06 Jan 2013
06 Jan 2013

Tight Cell-Probe Bounds for Online Hamming Distance Computation
Raphaël Clifford ... Benjamin Sach
-
Raphaël Clifford, et. al.Raphaël Clifford ... Benjamin Sach
06 Jan 2013
06 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Time Bounds for Streaming Problems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theory of Computing