Power-law distributions in binned empirical data

Yogesh Virkar,Aaron Clauset

doi:10.1214/13-aoas710

Abstract

Many man-made and natural phenomena, including the intensity of earthquakes, population of cities and size of international wars, are believed to follow power-law distributions. The accurate identification of power-law patterns has significant consequences for correctly understanding and modeling complex systems. However, statistical evidence for or against the power-law hypothesis is complicated by large fluctuations in the empirical distribution’s tail, and these are worsened when information is lost from binning the data. We adapt the statistically principled framework for testing the power-law hypothesis, developed by Clauset, Shalizi and Newman, to the case of binned data. This approach includes maximum-likelihood fitting, a hypothesis test based on the Kolmogorov–Smirnov goodness-of-fit statistic and likelihood ratio tests for comparing against alternative explanations. We evaluate the effectiveness of these methods on synthetic binned data with known structure, quantify the loss of statistical power due to binning, and apply the methods to twelve real-world binned data sets with heavy-tailed patterns.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Annals of Applied Statistics	Publication Date: Mar 1, 2014
Citations: 220	License type: unspecified-oa

R Discovery Prime

R Discovery Prime

Power-law distributions in binned empirical data

Abstract

Talk to us

Similar Papers

More From: The Annals of Applied Statistics

Lead the way for us

Similar Papers

A tale of two tails: Do Power Law and Lognormal models fit firm-size distributions in the mid-Victorian era?
Piero Montebruno ... Harry Smith
Physica A: Statistical Mechanics and its Applications | VOL. 523
Piero Montebruno, et. al.Piero Montebruno ... Harry Smith
08 Mar 2019
Physica A: Statistical Mechanics and its Applications | VOL. 523

Identification of the Best-Fit Probability Distribution and Modeling Short-Duration Intensity-Duration-Frequency Curves—Mangalore
C Varghese Femin ... K Varija
-
C Varghese Femin, et. al.C Varghese Femin ... K Varija
29 Sep 2020
29 Sep 2020

K olmogorov– S mirnov Tests
Vance W Berger ... Yanyan Zhou
-
Vance W Berger, et. al.Vance W Berger ... Yanyan Zhou
15 Apr 2005
15 Apr 2005

Using simulation to study statistical tests for arrival process and service time models for service systems
...
-
, et. al. ...
08 Dec 2013
08 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Power-law distributions in binned empirical data

Abstract

Talk to us

Similar Papers

More From: The Annals of Applied Statistics