Code Generation by Example Using Symbolic Machine Learning

Kevin Lano,Qiaomu Xue

doi:10.1007/s42979-022-01573-4

Kevin Lano, Qiaomu Xue

Open Access

PDF Available

https://doi.org/10.1007/s42979-022-01573-4

Copy DOI

Export

Save

Cite

Journal: SN Computer Science	Publication Date: Jan 17, 2023
Citations: 7	License type: open-access

Affiliation: King's College London

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Code generation is a key technique for model-driven engineering (MDE) approaches of software construction. Code generation enables the synthesis of applications in executable programming languages from high-level specifications in UML or in a domain-specific language. Specialised code generation languages and tools have been defined; however, the task of manually constructing a code generator remains a substantial undertaking, requiring a high degree of expertise in both the source and target languages, and in the code generation language. In this paper, we apply novel symbolic machine learning techniques for learning tree-to-tree mappings of software syntax trees, to automate the development of code generators from source–target example pairs. We evaluate the approach on several code generation tasks, and compare the approach to other code generator construction approaches. The results show that the approach can effectively automate the synthesis of code generators from examples, with relatively small manual effort required compared to existing code generation construction approaches. We also identified that it can be adapted to learn software abstraction and translation algorithms. The paper demonstrates that a symbolic machine learning approach can be applied to assist in the development of code generators and other tools manipulating software syntax trees.

Full Text