We consider the problem of estimating the frequency count of data stream elements under polynomial decay functions. In these settings every element in the stream is assigned with a time-decreasing weight, using a non-increasing polynomial function. Decay functions are used in applications where older data is less significant, less interesting or even less reliable than recent data. Consider a data stream of N elements drawn from a universe U . We propose three poly-logarithmic algorithms for the problem. The first one, deterministic, uses O ( 1 ϵ 2 log N ( log log N + log U ) ) bits, where ϵ ∈ ( 0 , 1 ) is the approximation parameter. The second one, probabilistic, uses O ( 1 ϵ 2 log N δ log 1 ϵ ) bits or O ( 1 ϵ 2 log N δ log N ) bits, depending on the decay function parameter, where δ ∈ ( 0 , 1 ) is the probability of failure. The third one, deterministic in the stochastic model, uses O ( 1 ϵ log U ) bits or O ( 1 ϵ 2 log N ) bits, also depending on the decay parameter as will be described in this paper. This variant of the problem is important and has many applications. To our knowledge, it has never been studied before.
Read full abstract