Efficiency Analysis of the access method with the cascading Bloom filter to the data warehouse on the parallel computing platform

Yu A Grigoriev,O Yu Ermakov,V A Proletarskaya,E Yu Ermakov

doi:10.1088/1742-6596/913/1/012011

Efficiency Analysis of the access method with the cascading Bloom filter to the data warehouse on the parallel computing platform

Yu A Grigoriev, O Yu Ermakov + Show 2 more

Open Access

https://doi.org/10.1088/1742-6596/913/1/012011

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Oct 1, 2017
Citations: 1	License type: cc-by

#Graph In Terms #Bloom Filter + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A new method was developed with a cascading Bloom filter (CBF) for executing SQL queries in the Apache Spark parallel computing environment. It includes the representation of the original query in the form of several subqueries, the development of a connection graph and the transformation of subqueries, the definition of connections where it is necessary to use Bloom filters, the representation of the graph in terms of Spark. On the example of the query Q3 of the TPC-H test, full-scale experiments were carried out, which confirmed the effectiveness of the developed method.

Full Text