On Set Size Distribution Estimation and the Characterization of Large Networks via Sampling

Fabricio Murai,Don Towsley,Bruno Ribeiro,Pinghui Wang

doi:10.1109/jsac.2013.130604

Abstract

In this work we study the set size distribution estimation problem, where elements are randomly sampled from a collection of non-overlapping sets and we seek to recover the original set size distribution from the samples. This problem has applications to capacity planning and network theory. Examples of real-world applications include characterizing in-degree distributions in large graphs and uncovering TCP/IP flow size distributions on the Internet. We demonstrate that it is difficult to estimate the original set size distribution. The recoverability of original set size distributions presents a sharp threshold with respect to the fraction of elements that remain in the sets. If this fraction lies below the threshold, typically half of the elements in power-law and heavier-than-exponential-tailed distributions, then the original set size distribution is unrecoverable. We also discuss practical implications of our findings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On Set Size Distribution Estimation and the Characterization of Large Networks via Sampling

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Communications

Lead the way for us

Journal: IEEE Journal on Selected Areas in Communications	Publication Date: Jun 1, 2013
Citations: 28

Similar Papers

A High Volume Apparatus for the Condensational Growth of Ultrafine Particles for Inhalation Toxicological Studies
Philip Demokritou ... Petros Koutrakis
Aerosol Science and Technology | VOL. 36
Philip Demokritou, et. al.Philip Demokritou ... Petros Koutrakis
01 Nov 2002
Aerosol Science and Technology | VOL. 36

Freeze-Drying of Nanosuspensions, Part 3: Investigation of Factors Compromising Storage Stability of Highly Concentrated Drug Nanosuspensions
Jakob Beirowski ... Henning Gieseler
Journal of Pharmaceutical Sciences | VOL. 101
Jakob Beirowski, et. al.Jakob Beirowski ... Henning Gieseler
01 Jan 2012
Journal of Pharmaceutical Sciences | VOL. 101

Hybrid sampling for estimating flow size distribution and its implementation
Takayuki Goto ... Chikara Sasaki
-
Takayuki Goto, et. al.Takayuki Goto ... Chikara Sasaki
01 Jun 2010
01 Jun 2010

Modeling fragmentation by percolation in combustion of carbons
P Salatino ... L Massimilla
Powder Technology | VOL. 66
P Salatino, et. al.P Salatino ... L Massimilla
01 Apr 1991
Powder Technology | VOL. 66

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Set Size Distribution Estimation and the Characterization of Large Networks via Sampling

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Communications