Communication Complexity of Statistical Distance

Thomas Watson

doi:10.1145/3170708

Abstract

We prove nearly matching upper and lower bounds on the randomized communication complexity of the following problem: Alice and Bob are each given a probability distribution over n elements, and they wish to estimate within ±ε the statistical (total variation) distance between their distributions. For some range of parameters, there is up to a log n factor gap between the upper and lower bounds, and we identify a barrier to using information complexity techniques to improve the lower bound in this case. We also prove a side result that we discovered along the way: the randomized communication complexity of n -bit Majority composed with n -bit Greater Than is Θ ( n log n ).

Highlights

Statistical (a.k.a. total variation) distance is a standard measure of the distance between two probability distributions, and is ubiquitous in theoretical computer science
It is natural to inquire about the computational complexity of estimating the statistical distance between two distributions x and y that are given as input
[25] showed that when each of x and y is succinctly represented by an algorithm that takes uniform random bits and produces a sample from that distribution, the problem of estimating ∆(x, y) is complete for the complexity class SZK. (For results about the complexity of other problems where the inputs are succinctly represented distributions, see [12, 13, 3, 14, 30, 29].)

Summary

Introduction

Statistical (a.k.a. total variation) distance is a standard measure of the distance between two probability distributions, and is ubiquitous in theoretical computer science. It is natural to inquire about the computational complexity of estimating the statistical distance between two distributions x and y that are given as input. This topic has been studied before in several contexts:. (For results about the complexity of other problems where the inputs are black-box samples from distributions, see the surveys [14, 24, 7].) [10, 11] studied the space complexity of (a generalization of) statistical distance estimation when the vectors x and y are provided as data streams 49:2 Communication Complexity of Statistical Distance [2, 27, 9] studied the complexity of statistical distance estimation when an algorithm is only given black-box access to oracles that produce samples from the distributions specified by x and y. (For results about the complexity of other problems where the inputs are black-box samples from distributions, see the surveys [14, 24, 7].) [10, 11] studied the space complexity of (a generalization of) statistical distance estimation when the vectors x and y are provided as data streams

Communication Upper and Lower Bounds

Composing with Majority

Preliminaries

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Computation Theory	Publication Date: Jan 24, 2018
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Communication Complexity of Statistical Distance

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ACM Transactions on Computation Theory

Lead the way for us

Similar Papers

Communication Complexity of Statistical Distance

-

01 Jan 2017
01 Jan 2017

Separations in Communication Complexity Using Cheat Sheets and Information Complexity
Anurag Anshu ... Aleksandrs Belovs
-
Anurag Anshu, et. al.Anurag Anshu ... Aleksandrs Belovs
01 Oct 2016
01 Oct 2016

Disjointness through the Lens of Vapnik-Chervonenkis Dimension: Sparsity and Beyond
...
-
, et. al. ...
01 Jun 2020
01 Jun 2020

On the total variation and Hellinger distance between signed measures; an application to product measures
Ton Steerneman
Proceedings of the American Mathematical Society | VOL. 88
Ton SteernemanTon Steerneman
01 Jan 1982
Proceedings of the American Mathematical Society | VOL. 88

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Communication Complexity of Statistical Distance

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ACM Transactions on Computation Theory