Analysis of False Negative Rates for Recycling Bloom Filters (Yes, They Happen!)

Kahlil Dozier,Dan Rubenstein,Loqman Salamatian

doi:10.1145/3656005

Abstract

Bloom Filters are a desirable data structure for distinguishing new values in sequences of data (i.e., messages), due to their space efficiency, their low false positive rates (incorrectly classifying a new value as a repeat), and never producing false negatives (classifying a repeat value as new). However, as the Bloom Filter's bits are filled, false positive rates creep upward. To keep false positive rates below a reasonable threshold, applications periodically "recycle" the Bloom Filter, clearing the memory and then resuming the tracking of data. After a recycle point, subsequent arrivals of recycled messages are likely to be misclassified as new; recycling induces false negatives. Despite numerous applications of recycling, the corresponding false negative rates have never been analyzed. In this paper, we derive approximations, upper bounds, and lower bounds of false negative rates for several variants of recycling Bloom Filters. These approximations and bounds are functions of the size of memory used to store the Bloom Filter and the distributions on new arrivals and repeat messages, and can be efficiently computed on conventional hardware. We show, via comparison to simulation, that our upper bounds and approximations are extremely tight, and can be efficiently computed for megabyte-sized Bloom Filters on conventional hardware.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of False Negative Rates for Recycling Bloom Filters (Yes, They Happen!)

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Measurement and Analysis of Computing Systems

Lead the way for us

Journal: Proceedings of the ACM on Measurement and Analysis of Computing Systems	Publication Date: May 21, 2024
License type: mit

Similar Papers

Analysis of False Negative Rates for Recycling Bloom Filters (Yes, They Happen!)
Kahlil Dozier ... Loqman Salamatian
ACM SIGMETRICS Performance Evaluation Review | VOL. 52
Kahlil Dozier, et. al.Kahlil Dozier ... Loqman Salamatian
11 Jun 2024
ACM SIGMETRICS Performance Evaluation Review | VOL. 52

On Reducing False Positives of a Bloom Filter in Trie-Based Algorithms
Ju Hyoung Mun ... Hyesook Lim
IEIE Transactions on Smart Processing and Computing | VOL. 4
Ju Hyoung Mun, et. al.Ju Hyoung Mun ... Hyesook Lim
30 Jun 2015
IEIE Transactions on Smart Processing and Computing | VOL. 4

False Negative Problem of Counting Bloom Filter
Deke Guo ... Yunhao Liu
IEEE Transactions on Knowledge and Data Engineering | VOL. 22
Deke Guo, et. al. Deke Guo ... Yunhao Liu
01 May 2010
IEEE Transactions on Knowledge and Data Engineering | VOL. 22

FBF: Bloom Filter for Fuzzy Membership Queries on Strings
Rishabh Kumar ... Hemant Tiwari
-
Rishabh Kumar, et. al.Rishabh Kumar ... Hemant Tiwari
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of False Negative Rates for Recycling Bloom Filters (Yes, They Happen!)

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Measurement and Analysis of Computing Systems