Uniform Random Sampling Not Recommended

Jianguo Lu,Hao Wang,Dingding Li

doi:10.1145/3184558.3186240

Uniform Random Sampling Not Recommended

Jianguo Lu, Hao Wang + Show 1 more

Open Access

https://doi.org/10.1145/3184558.3186240

Copy DOI

Publication Date: Jan 1, 2018

Affiliation: University of Windsor

#Uniform Random Sampling #Probability Proportional To Size + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We show that uniform random sampling is not as effective as PPS (probability proportional to size) sampling in many estimation tasks. In the setting of (graph) size estimation, this paper demonstrates that random edge sampling outperforms random node sampling, with a performance ratio proportional to the normalized graph degree variance. This result is particularly important in the era of big data, when data are typically large and scale-free, resulting in large degree variance. We derive the result by first giving the variances of random node and random edge estimators. A simpler and more intuitive result is obtained by assuming that the data is large and degree distribution follows a power law.

Full Text