Always Good Turing: asymptotically optimal probability estimation.

Alon Orlitsky,Narayana P Santhanam,Junan Zhang

doi:10.1126/science.1088284

Always Good Turing: asymptotically optimal probability estimation.

Alon Orlitsky, Narayana P Santhanam + Show 1 more

https://doi.org/10.1126/science.1088284

Copy DOI

Journal: Science (New York, N.Y.)	Publication Date: Oct 17, 2003
Citations: 129

Affiliation: University of California, San Diego

#Infinite Attenuation #Sample Of Data + Show 4 more

Abstract
Full-Text PDF
Similar Papers

Abstract

While deciphering the Enigma code, Good and Turing derived an unintuitive, yet effective, formula for estimating a probability distribution from a sample of data. We define the attenuation of a probability estimator as the largest possible ratio between the per-symbol probability assigned to an arbitrarily long sequence by any distribution, and the corresponding probability assigned by the estimator. We show that some common estimators have infinite attenuation and that the attenuation of the Good-Turing estimator is low, yet greater than 1. We then derive an estimator whose attenuation is 1; that is, asymptotically it does not underestimate the probability of any sequence.

Full Text