Динамическая компиляция выражений в SQL-запросах для СУБД PostgreSQL

E.Y Sharygin,R.A Buchatskiy,D.M Melnik,R.A Zhuykov,L.V Skvortsov

doi:10.15514/ispras-2016-28(4)-13

Abstract

In recent years, as performance and capacity of main and external memory grow, performance of database management systems (DBMSes) on certain kinds of queries is more determined by raw CPU speed. Currently, PostgreSQL uses the interpreter to execute SQL queries. This yields an overhead caused by indirect calls to handler functions and runtime checks, which could be avoided if the query were compiled into native code on-the-fly, i.e. just-in-time (JIT) compiled: at run time the specific table structure is known as well as data types and built-in functions used in the query as well as the query itself. This is especially important for complex queries, performance of which is CPU-bound. We’ve developed a PostgreSQL extension that implements SQL query JIT compilation using LLVM compiler infrastructure. In this paper we show how to implement JIT compilation to speed up sequential scan operator (SeqScan) as well as expressions in WHERE clauses. We describe some important optimizations that are possible only with dynamic compilation, such as precomputing tuple attributes offsets only for attributes used by the query. We also discuss the maintainability of our extension, i.e. the automation for translating PostgreSQL backend functions into LLVM IR, using the same source code both for our JIT compiler and the existing interpreter. Currently, with LLVM JIT we achieve up to 5x speedup on synthetic tests as compared to original PostgreSQL interpreter.

Highlights

Аналогично с типами переменных и констант, которые внутри PostgreSQL хранятся в виде 64-битных значений (Datum), что значит, что для каждого типа необходимо написать функцию, конвертирующую 64-битное значение в значение необходимого типа и обратно
As performance and capacity of main and external memory grow, performance of database management systems (DBMSes) on certain kinds of queries is more determined by raw CPU speed
In this paper we show how to implement JIT compilation to speed up sequential scan operator (SeqScan) as well as expressions in WHERE clauses

Summary

Введение

Работы по улучшению производительности большинства реляционных СУБД традиционно были в основном направлены на оптимизацию доступа к памяти ценой менее эффективного использования процессора. Реализация в СУБД алгебры реляционных операторов и модели итераторов [1] позволяет упростить как построение и оптимизацию планов, так и реализацию реляционных операторов в отдельности, но в то же время приводит к значительным накладным расходам при выполнении плана. С ростом объёмов и улучшением операционных характеристик доступа к оперативной памяти накладные расходы, связанные с неэффективным использованием процессора, становятся всё более заметными. Одно из решений — динамическая компиляция запросов, которая позволяет во время выполнения получить эффективный машинный код, оптимизированный с учётом структуры конкретного запроса, используемых в нём типов данных и функций, и параметров базы данных, таких как размер и схема используемых таблиц, типы индексов и т.д. В данной работе рассматривается динамическая компиляция выражений оператора WHERE и метода последовательного сканирования SeqScan для СУБД PostgreSQL [2] с помощью компиляторной инфраструктуры LLVM [3]

Обзор схожих работ

Архитектура обработки запроса в СУБД PostgreSQL

Реализация динамической компиляции в PostgreSQL

Динамическая компиляция метода сканирования SeqScan

Отказ от итеративной модели

Подстановка констант и дальнейшая специализация

Динамическая компиляция выражений оператора WHERE

Предварительная компиляция встроенных функций

Заключение

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the Institute for System Programming of the RAS	Publication Date: Jan 1, 2016
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Динамическая компиляция выражений в SQL-запросах для СУБД PostgreSQL

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Institute for System Programming of the RAS

Lead the way for us

Similar Papers

Динамическая компиляция SQL-запросов для СУБД PostgreSQL
R.A Buchatskiy ... E.Y Sharygin
Proceedings of the Institute for System Programming of the RAS | VOL. 28
R.A Buchatskiy, et. al.R.A Buchatskiy ... E.Y Sharygin
01 Jan 2015
Proceedings of the Institute for System Programming of the RAS | VOL. 28

Кэширование машинного кода в динамическом компиляторе SQL-запросов для СУБД PostgreSQL
Michael Vyacheslavovich Pantilimonov ... Roman Aleksandrovich Zhuykov
Proceedings of the Institute for System Programming of the RAS | VOL. 32
Michael Vyacheslavovich Pantilimonov, et. al.Michael Vyacheslavovich Pantilimonov ... Roman Aleksandrovich Zhuykov
01 Jan 2020
Proceedings of the Institute for System Programming of the RAS | VOL. 32

Machine Code Caching in PostgreSQL Query JIT-Compiler
Michael Pantilimonov ... Ruben Buchatskiy
-
Michael Pantilimonov, et. al.Michael Pantilimonov ... Ruben Buchatskiy
01 Sep 2019
01 Sep 2019

AOT vs. JIT: impact of profile data on code quality
April W Wade ... Prasad A Kulkarni
ACM SIGPLAN Notices | VOL. 52
April W Wade, et. al.April W Wade ... Prasad A Kulkarni
21 Jun 2017
ACM SIGPLAN Notices | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Динамическая компиляция выражений в SQL-запросах для СУБД PostgreSQL

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Institute for System Programming of the RAS