Performance-optimal filtering

Harald Lang,Peter Boncz,Thomas Neumann,Alfons Kemper

doi:10.14778/3303753.3303757

Harald Lang, Peter Boncz + Show 2 more

Open Access

https://doi.org/10.14778/3303753.3303757

Copy DOI

Abstract

We define the concept of performance-optimal filtering to indicate the Bloom or Cuckoo filter configuration that best accelerates a particular task. While the space-precision tradeoff of these filters has been well studied, we show how to pick a filter that maximizes the performance for a given workload. This choice might be "suboptimal" relative to traditional space-precision metrics, but it will lead to better performance in practice. In this paper, we focus on high-throughput filter use cases, aimed at avoiding CPU work, e.g., a cache miss, a network message, or a local disk I/O - events that can happen at rates of millions to hundreds per second. Besides the false-positive rate and memory footprint of the filter, performance optimality has to take into account the absolute cost of the filter lookup as well as the saved work per lookup that filtering avoids; while the actual rate of negative lookups in the workload determines whether using a filter improves overall performance at all. In the course of the paper, we introduce new filter variants, namely the register-blocked and cache-sectorized Bloom filters. We present new implementation techniques and perform an extensive evaluation on modern hardware platforms, including the wide-SIMD Skylake-X and Knights Landing. This experimentation shows that in high-throughput situations, the lower lookup cost of blocked Bloom filters allows them to overtake Cuckoo filters.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the VLDB Endowment	Publication Date: Jan 1, 2019
Citations: 35	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Performance-optimal filtering

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Similar Papers

Design and Evaluation of Cascading Cuckoo Filters for Zero-False-Positive Membership Services
Sai Medury ... Amani Altarawneh
-
Sai Medury, et. al.Sai Medury ... Amani Altarawneh
27 Jan 2021
27 Jan 2021

Xor Filters
Thomas Mueller Graf ... Daniel Lemire
ACM Journal of Experimental Algorithmics | VOL. 25
Thomas Mueller Graf, et. al.Thomas Mueller Graf ... Daniel Lemire
13 Mar 2020
ACM Journal of Experimental Algorithmics | VOL. 25

CFBF: Reducing the Insertion Time of Cuckoo Filters With an Integrated Bloom Filter
Pedro Reviriego ... Jorge Martinez
IEEE Communications Letters | VOL. 23
Pedro Reviriego, et. al.Pedro Reviriego ... Jorge Martinez
01 Oct 2019
IEEE Communications Letters | VOL. 23

Cuckoo Filter
Bin Fan ... Dave G Andersen
-
Bin Fan, et. al.Bin Fan ... Dave G Andersen
02 Dec 2014
02 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance-optimal filtering

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment