More Accurate Streaming Cardinality Estimation With Vectorized Counters

Valerio Bruschi,Daniel Ting,Salvatore Pontarelli,Pedro Reviriego,Giuseppe Bianchi

doi:10.1109/lnet.2021.3076048

Abstract

Cardinality estimation, also known as count-distinct, is the problem of finding the number of different elements in a set with repeated elements. Among the many approximate algorithms proposed for this task, HyperLogLog (HLL) has established itself as the state of the art due to its ability to accurately estimate cardinality over a large range of values using a small memory footprint. When elements arrive in a stream, as in the case of most networking applications, improved techniques are possible. We specifically propose a new algorithm that improves the accuracy of cardinality estimation by grouping counters, and by using their new organization to further track all updates within a given counter size range (compared with just the last update as in the standard HLL). Results show that when using the same number of counters, one configuration of the new scheme reduces the relative error by approximately 0.86x using the same amount of memory as the streaming HLL and another configuration achieves a similar accuracy reducing the memory needed by approximately 0.85x.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

More Accurate Streaming Cardinality Estimation With Vectorized Counters

Abstract

Talk to us

Similar Papers

More From: IEEE Networking Letters

Lead the way for us

Journal: IEEE Networking Letters	Publication Date: Jun 1, 2021
Citations: 1

Similar Papers

Security of HyperLogLog (HLL) Cardinality Estimation: Vulnerabilities and Protection
Pedro Reviriego ... Daniel Ting
IEEE Communications Letters | VOL. 24
Pedro Reviriego, et. al.Pedro Reviriego ... Daniel Ting
01 May 2020
IEEE Communications Letters | VOL. 24

Fast Updates for Line-Rate HyperLogLog-Based Cardinality Estimation
Pedro Reviriego ... Giuseppe Bianchi
IEEE Communications Letters | VOL. 24
Pedro Reviriego, et. al.Pedro Reviriego ... Giuseppe Bianchi
20 Aug 2020
IEEE Communications Letters | VOL. 24

Protection of DDoS Attacks at the Application Layer: HyperLogLog (HLL) Cardinality Estimation
Balarengadurai Chinnaiah
-
Balarengadurai ChinnaiahBalarengadurai Chinnaiah
01 Jan 2020
01 Jan 2020

Staggered HLL: Near-continuous-time cardinality estimation with no overhead
Alessandro Cornacchia ... Paolo Giaccone
Computer Communications | VOL. 193
Alessandro Cornacchia, et. al.Alessandro Cornacchia ... Paolo Giaccone
27 Jun 2022
Computer Communications | VOL. 193

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

More Accurate Streaming Cardinality Estimation With Vectorized Counters

Abstract

Talk to us

Similar Papers

More From: IEEE Networking Letters