Modular, compositional, and executable formal semantics for LLVM IR

Yannick Zakowski,Irene Yoon,Vadim Zaliva,Ilia Zaichuk,Steve Zdancewic,Calvin Beck

doi:10.1145/3473572

Abstract

This paper presents a novel formal semantics, mechanized in Coq, for a large, sequential subset of the LLVM IR. In contrast to previous approaches, which use relationally-specified operational semantics, this new semantics is based on monadic interpretation of interaction trees, a structure that provides a more compositional approach to defining language semantics while retaining the ability to extract an executable interpreter. Our semantics handles many of the LLVM IR's non-trivial language features and is constructed modularly in terms of event handlers, including those that deal with nondeterminism in the specification. We show how this semantics admits compositional reasoning principles derived from the interaction trees equational theory of weak bisimulation, which we extend here to better deal with nondeterminism, and we use them to prove that the extracted reference interpreter faithfully refines the semantic model. We validate the correctness of the semantics by evaluating it on unit tests and LLVM IR programs generated by HELIX.

Highlights

The CompCert [Leroy 2009] C compiler was pivotal to the history of verified compilation, paving the way to large-scale software verification of real-world programming languages [Ringer et al 2019]
We focus on the LLVM framework [Lattner and Adve 2004] because it is an attractive target for formal verification: it is a widely used, industrial-strength codebase; its intermediate representation (IR) provides a comparatively small and reasonably well-defined core language; and many of its analyses, program transformations, and optimizations, operate entirely at the level of the LLVM IR itself
We present VIR, a compositional, modular and executable formal semantics in Coq for a realistic sequential subset of LLVM IR

Summary

INTRODUCTION

The CompCert [Leroy 2009] C compiler was pivotal to the history of verified compilation, paving the way to large-scale software verification of real-world programming languages [Ringer et al 2019]. We draw on classic ideas about how to structure monadic interpreters [Steele 1994] and make heavy use of interaction trees [Xia et al 2020], a recent Coq formalism that provides (1) expressive monadic combinators for defining compositional semantics, (2) effect handlers for the modular interpretation of effectful programs, and (3) a coinductive implementation that can be extracted into an executable definitional interpreter These features allow for a strong separation of concerns: each syntactic sub-component can be given a selfcontained meaning, and each effect of the language can be defined in isolation via an effect handler.

VIR: A FORMALIZATION OF LLVM IR

Syntax

Dynamic Values

BACKGROUND

A MODULAR LLVM SEMANTICS

An Inventory of LLVM’s Events

Handling Events

Stitching the Semantics Together

VIR EQUIVALENCES AND REFINEMENT

ITree Equivalences and Refinement Relations

Interpretation into P

Equational Theory for VIR

VIR Refinements

Floyd-Hoare-Style Forward Relational Reasoning

Expressing Functional Properties of VIR: a Derived Unary Program Logic

CASE STUDY

A Sound ITree-Based Semantics for FHCOL

REFERENCE INTERPRETER VALIDATION

RELATED WORK AND DISCUSSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the ACM on programming languages	Publication Date: Aug 19, 2021
Citations: 11	License type: cc-by

R Discovery Prime

R Discovery Prime

Modular, compositional, and executable formal semantics for LLVM IR

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the ACM on programming languages

Lead the way for us

Similar Papers

K-LLVM: A Relatively Complete Semantics of LLVM IR.
...
-
, et. al. ...
01 Nov 2020
01 Nov 2020

LLMC: Verifying High-Performance Software
Freark I Van Der Berg
-
Freark I Van Der BergFreark I Van Der Berg
01 Jan 2020
01 Jan 2020

Compositional semantics: An introduction to the syntax/semantics interface by Pauline Jacobson
Robert Henderson
Language | VOL. 93
Robert HendersonRobert Henderson
01 Jan 2017
Language | VOL. 93

DFSGraph: Data Flow Semantic Model for Intermediate Representation Programs Based on Graph Network
Ke Tang ... Fudong Liu
Electronics | VOL. 11
Ke Tang, et. al.Ke Tang ... Fudong Liu
08 Oct 2022
Electronics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modular, compositional, and executable formal semantics for LLVM IR

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the ACM on programming languages