Abstract

In a language corpus, the probability that a word occurs n times is often proportional to 1/ n2. Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size n is typically proportional to 1/ n, declining exponentially for larger n, the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns of abundance, the generic form of probability distributions plus the conserved average abundance is sufficient. The general theory includes cases that are between the Zipf and log series endpoints, providing a broad framework for analyzing widely observed abundance patterns.

Highlights

  • A few simple patterns recur in nature

  • Death and other failures typically follow the extreme value distributions. Those simple patterns recur under widely varying conditions

  • Something fundamental must set the relations between pattern and underlying process

Read more

Summary

25 Mar 2019 report report report report

A. Bettencourt, University of Chicago, Chicago, USA Santa Fe Institute, Santa Fe, USA. Any reports and responses or comments on the article can be found at the end of the article. This article is included in the Mathematical, Physical, and Computational Sciences collection

Introduction
Conclusion
Gabaix X
Axtell RL
11. Frank SA
15. Frank SA
17. Frank SA
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call