On the relation between context-free grammars and parsing expression grammars

Fabio Mascarenhas,Sérgio Medeiros,Roberto Ierusalimschy

doi:10.1016/j.scico.2014.01.012

Fabio Mascarenhas, Sérgio Medeiros + Show 1 more

Open Access

https://doi.org/10.1016/j.scico.2014.01.012

Copy DOI

Journal: Science of Computer Programming	Publication Date: Jan 29, 2014
Citations: 32	License type: publisher-specific-oa

Abstract

Context-Free Grammars (CFGs) and Parsing Expression Grammars (PEGs) have several similarities and a few differences in both their syntax and semantics, but they are usually presented through formalisms that hinder a proper comparison. In this paper we present a new formalism for CFGs that highlights the similarities and differences between them. The new formalism borrows from PEGs the use of parsing expressions and the recognition-based semantics. We show how one way of removing non-determinism from this formalism yields a formalism with the semantics of PEGs. We also prove, based on these new formalisms, how LL(1) grammars define the same language whether interpreted as CFGs or as PEGs, and also show how strong-LL(k), right-linear, and LL-regular grammars have simple language-preserving translations from CFGs to PEGs. Once these classes of CFGs can be automatically translated to equivalent PEGs, we can reuse classic top-down grammars in PEG-based tools.

Highlights

Context-Free Grammars (CFGs) are the formalism of choice for describing the syntax of programming languages
We presented a new formalism for context-free grammars that is based on recognizing strings instead of generating them
We adopted a subset of the syntax of parsing expression grammars, and the notion of letting a grammar recognize just part of an input string, to purposefully get a definition for CFGs that is closer to Parsing Expression Grammars (PEGs), yet defines the same class of languages as traditional CFGs

Summary

Introduction

Context-Free Grammars (CFGs) are the formalism of choice for describing the syntax of programming languages. We show that we can transform any LL-regular grammar into a PEG that recognizes the same language: we first prove that right-linear grammars for languages with the prefix property, a property that is easy to achieve, have the same language whether interpreted as CFGs or as PEGs, use this result to build lookahead expressions for the alternatives of each non-terminal based on which regular partition this alternative falls. While LL(1) grammars are a proper subset of strong-LL(k) grammars, which are a proper subsets of LL-regular grammars, making the LL-regular transformation work on grammars belonging to these simpler classes, the simpler classes have more straightforward transformations which merit a separate treatment Given that these classes of top-down CFGs can be automatically translated into equivalent PEGs, we can reuse classic top-down grammars in PEG-based tools.

From CFGs to PEGs

From CFGs to PE-CFGs

1: Natural semantics of abstract syntax of parsing expressions is given below:

Correspondence between CFGs and PE-CFGs

From PE-CFGs to PEGs

Correspondence with Ford’s Defintion

Right-linear and LL-regular Grammars

Related Work

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the relation between context-free grammars and parsing expression grammars

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Science of Computer Programming

Lead the way for us

Similar Papers

A semantic framework for PEGs
Sérgio Queiroz De Medeiros ... Carlos Olarte
-
Sérgio Queiroz De Medeiros, et. al.Sérgio Queiroz De Medeiros ... Carlos Olarte
15 Nov 2020
15 Nov 2020

Lexical Parsing Expression Recognition Schemata
Markus Lumpe
-
Markus LumpeMarkus Lumpe
01 Sep 2015
01 Sep 2015

Parsing expression grammars
Bryan Ford
ACM SIGPLAN Notices | VOL. 39
Bryan FordBryan Ford
01 Jan 2004
ACM SIGPLAN Notices | VOL. 39

Parsing expression grammars
Bryan Ford
-
Bryan FordBryan Ford
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the relation between context-free grammars and parsing expression grammars

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Science of Computer Programming