Revisiting Reuse in Main Memory Database Systems

Kayhan Dursun,Carsten Binnig,Tim Kraska,Ugur Cetintemel

doi:10.1145/3035918.3035957

Abstract

Reusing intermediates in databases to speed-up analytical query processing was studied in prior work. Existing solutions require intermediate results of individual operators to be materialized using materialization operators. However, inserting such materialization operations into a query plan not only incurs additional execution costs but also often eliminates important cache- and register-locality opportunities, resulting in even higher performance penalties. This paper studies a novel reuse model for intermediates, which caches internal physical data structures materialized during query processing (due to pipeline breakers) and externalizes them so that they become reusable for upcoming operations. We focus on hash tables, the most commonly used internal data structure in main memory databases to perform join and aggregation operations. As queries arrive, our reuse-aware optimizer reasons about the reuse opportunities for hash tables, employing cost models that take into account hash table statistics together with the CPU and data movement costs within the cache hierarchy. Experimental results, based on our prototype implementation, demonstrate performance gains of 2x for typical analytical workloads with no additional overhead for materializing intermediates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Revisiting Reuse in Main Memory Database Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Energy-Efficient Transaction Serialization for IoT Devices
Daniel Evans
Journal of Computer Science Research | VOL. 2
Daniel EvansDaniel Evans
29 May 2020
Journal of Computer Science Research | VOL. 2

A Compression-Based Design for Higher Throughput in a Lock-Free Hash Map
Pedro Moreno ... Ricardo Rocha
-
Pedro Moreno, et. al.Pedro Moreno ... Ricardo Rocha
01 Jan 2020
01 Jan 2020

INFELT STEP: An integrated and interoperable platform for collaborative CAD/CAPP/CAM/CNC machining systems based on STEP standard
Omid F Valilai ... Mahmoud Houshmand
International Journal of Computer Integrated Manufacturing | VOL. 23
Omid F Valilai, et. al.Omid F Valilai ... Mahmoud Houshmand
01 Dec 2010
International Journal of Computer Integrated Manufacturing | VOL. 23

ASH: A Modern Framework for Parallel Spatial Hashing in 3D Perception.
Wei Dong ... Yixing Lao
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Wei Dong, et. al.Wei Dong ... Yixing Lao
01 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Revisiting Reuse in Main Memory Database Systems

Abstract

Talk to us

Similar Papers