Flan: An Expressive and Efficient Datalog Compiler for Program Analysis

Supun Abeysinghe,Anxhelo Xhebraj,Tiark Rompf

doi:10.1145/3632928

Supun Abeysinghe, Anxhelo Xhebraj + Show 1 more

Open Access

https://doi.org/10.1145/3632928

Copy DOI

Abstract

Datalog has gained prominence in program analysis due to its expressiveness and ease of use. Its generic fixpoint resolution algorithm over relational domains simplifies the expression of many complex analyses. The performance and scalability issues of early Datalog approaches have been addressed by tools such as Soufflé through specialized code generation. Still, while pure Datalog is expressive enough to support a wide range of analyses, there is a growing need for extensions to accommodate increasingly complex analyses. This has led to the development of various extensions, such as Flix, Datafun, and Formulog, which enhance Datalog with features like arbitrary lattices and SMT constraints. Most of these extensions recognize the need for full interoperability between Datalog and a full-fledged programming language, a functionality that high-performance systems like Soufflé lack. Specifically, in most cases, they construct languages from scratch with first-class Datalog support, allowing greater flexibility. However, this flexibility often comes at the cost of performance due to the conflicting requirements of prioritizing modularity and abstraction over efficiency. Consequently, achieving both flexibility and compilation to highly-performant specialized code poses a significant challenge. In this work, we reconcile the competing demands of expressiveness and performance with Flan, a Datalog compiler fully embedded in Scala that leverages multi-stage programming to generate specialized code for enhanced performance. Our approach combines the flexibility of Flix with Soufflé’s performance, offering seamless integration with the host language that enables the addition of powerful extensions while generating specialized code for the entire computation. Flan’s simple operator interface allows the addition of an extensive set of features, including arbitrary aggregates, user-defined functions, and lattices, with multiple execution strategies such as binary and multi-way joins, supported by different indexing structures like specialized trees and hash tables, with minimal effort. We evaluate our system on a variety of benchmarks and compare it to established Datalog engines. Our results demonstrate competitive performance and speedups in the range of 1.4× to 12.5× compared to state-of-the-art systems for workloads of practical importance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the ACM on Programming Languages	Publication Date: Jan 5, 2024
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Flan: An Expressive and Efficient Datalog Compiler for Program Analysis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages

Lead the way for us

Similar Papers

Vector and matrix operations programmed with UDFs in a relational DBMS
Carlos Ordonez ... Javier García-García
-
Carlos Ordonez, et. al.Carlos Ordonez ... Javier García-García
01 Jan 2006
01 Jan 2006

Wormhole
Xingbo Wu ... Fan Ni
-
Xingbo Wu, et. al.Xingbo Wu ... Fan Ni
25 Mar 2019
25 Mar 2019

Flat combining and the synchronization-parallelism tradeoff
Danny Hendler ... Moran Tzafrir
-
Danny Hendler, et. al.Danny Hendler ... Moran Tzafrir
13 Jun 2010
13 Jun 2010

Tool Demonstration: Silver Extensible Compiler Frameworks and Modular Language Extensions for Java and C
Eric Wyk ... Eric Johnson
-
Eric Wyk, et. al.Eric Wyk ... Eric Johnson
01 Sep 2006
01 Sep 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flan: An Expressive and Efficient Datalog Compiler for Program Analysis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages