Improving and Extending the Testing of Distributions for Shape-Restricted Properties

E Fischer,O Lachish,Y Vasudev

doi:10.1007/s00453-019-00598-1

Abstract

Distribution testing deals with what information can be deduced about an unknown distribution over $$\{1,\ldots ,n\}$$ , where the algorithm is only allowed to obtain a relatively small number of independent samples from the distribution. In the extended conditional sampling model, the algorithm is also allowed to obtain samples from the restriction of the original distribution on subsets of $$\{1,\ldots ,n\}$$ . In 2015, Canonne, Diakonikolas, Gouleakis and Rubinfeld unified several previous results, and showed that for any property of distributions satisfying a “decomposability” criterion, there exists an algorithm (in the basic model) that can distinguish with high probability distributions satisfying the property from distributions that are far from it in the variation distance. We present here a more efficient yet simpler algorithm for the basic model, as well as very efficient algorithms for the conditional model, which until now was not investigated under the umbrella of decomposable properties. Additionally, we provide an algorithm for the conditional model that handles a much larger class of properties. Our core mechanism is an algorithm for efficiently producing an interval-partition of $$\{1,\ldots ,n\}$$ that satisfies a “fine-grain” quality. We show that with such a partition at hand we can avoid the search for the “correct” partition of $$\{1,\ldots ,n\}$$ .

Highlights

1.1 Historical backgroundIn most computational problems that arise from modeling real-world situations, we are required to analyze large amounts of data to decide if it satisfies a fixed property
There has been a long line of research, especially in statistics, where the underlying object from which we obtain the data is modeled as a probability distribution
We study distribution testing in the standard sampling model, as well as in the conditional model

Summary

Historical background

In most computational problems that arise from modeling real-world situations, we are required to analyze large amounts of data to decide if it satisfies a fixed property. L-decomposable, there is an efficient algorithm for testing whether a given distribution belongs to the property C To achieve their results, Canonne et al ([8]) show that if a distribution μ supported over [n] is L-decomposable, it is O(L log n)-decomposable where the intervals are of the form [j2i + 1, (j + 1)2i]. Canonne et al ([8]) show that if a distribution μ supported over [n] is L-decomposable, it is O(L log n)-decomposable where the intervals are of the form [j2i + 1, (j + 1)2i] This presents a natural approach of computing the interval partition in a recursive manner, by bisecting an interval if it has a large probability weight and is not close to uniform. For further elaboration of this connection see [11]

Results and techniques

Preliminaries

Fine partitions and how to pull them

Handling decomposable distributions

Weakly tolerant interval uniformity tests

Assessing an interval partition

Learning and testing decomposable distributions and properties

Introducing properties characterized by atlases

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithmica	Publication Date: Jun 21, 2019
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Improving and Extending the Testing of Distributions for Shape-Restricted Properties

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithmica

Lead the way for us

Similar Papers

Improving and extending the testing of distributions for shape-restricted properties
...
-
, et. al. ...
01 Jan 2017
01 Jan 2017

Construction of image processing procedures from a small number of learning samples using the IMPRESS vision expert system
Toshihiro Hamada ... Jun‐Ichi Hasegawa
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87
Toshihiro Hamada, et. al.Toshihiro Hamada ... Jun‐Ichi Hasegawa
07 Oct 2004
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87

A speaker‐adaptation technique for context‐dependent models represented by hidden markov networks
Jun-Ichi Takami ... Shigeki Sagayama
Systems and Computers in Japan | VOL. 27
Jun-Ichi Takami, et. al.Jun-Ichi Takami ... Shigeki Sagayama
01 Jan 1996
Systems and Computers in Japan | VOL. 27

Doppler angle estimation using AR modeling.
Chih-Kuang Yeh ... Pai-Chi Li
IEEE transactions on ultrasonics, ferroelectrics, and frequency control | VOL. 49
Chih-Kuang Yeh, et. al. Chih-Kuang Yeh ... Pai-Chi Li
01 Jun 2002
IEEE transactions on ultrasonics, ferroelectrics, and frequency control | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving and Extending the Testing of Distributions for Shape-Restricted Properties

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithmica