Automated Program Refinement: Guide and Verify Code Large Language Model with Refinement Calculus

Yufan Cai,Zhe Hou,David Sanan,Xiaokun Luan,Yun Lin,Jun Sun,Jin Song Dong

doi:10.1145/3704905

Yufan Cai, Zhe Hou + Show 5 more

Open Access

https://doi.org/10.1145/3704905

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Recently, the rise of code-centric Large Language Models (LLMs) has reshaped the software engineering world with low-barrier tools like Copilot that can easily generate code. However, there is no correctness guarantee for the code generated by LLMs, which suffer from the hallucination problem, and their output is fraught with risks. Besides, the end-to-end process from specification to code through LLMs is a non-transparent and uncontrolled black box. This opacity makes it difficult for users to understand and trust the generated code. Addressing these challenges is both necessary and critical. In contrast, program refinement transforms high-level specification statements into executable code while preserving correctness. Traditional tools for program refinement are primarily designed for formal methods experts and lack automation and extensibility. We apply program refinement to guide LLM and validate the LLM-generated code while transforming refinement into a more accessible and flexible framework. To initiate this vision, we propose Refine4LLM, an approach that aims to: (1) Formally refine the specifications, (2) Automatically prompt and guide the LLM using refinement calculus, (3) Interact with the LLM to generate the code, (4) Verify that the generated code satisfies the constraints, thus guaranteeing its correctness, (5) Learn and build more advanced refinement laws to extend the refinement calculus. We evaluated Refine4LLM against the state-of-the-art baselines on program refinement and LLMs benchmarks.The experiment results show that Refine4LLM can efficiently generate more robust code and reduce the time for refinement and verification.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Automated Program Refinement: Guide and Verify Code Large Language Model with Refinement Calculus

Abstract

Published Version

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages

Lead the way for us

Similar Papers

Can LLM Replace Stack Overflow? A Study on Robustness and Reliability of Large Language Model Code Generation
Li Zhong ... Zilong Wang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Li Zhong, et. al.Li Zhong ... Zilong Wang
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

The performance of the LSTM-based code generated by Large Language Models (LLMs) in forecasting time series data
Saroj Gopali ... Akbar Siami Namin
Natural Language Processing Journal | VOL. 9
Saroj Gopali, et. al.Saroj Gopali ... Akbar Siami Namin
27 Nov 2024
Natural Language Processing Journal | VOL. 9

A tutorial on open-source large language models for behavioral science.
Zak Hussain ... Dirk U Wulff
Behavior research methods | VOL. 56
Zak Hussain, et. al.Zak Hussain ... Dirk U Wulff
15 Aug 2024
Behavior research methods | VOL. 56

754 Prediction of 30-day All-Cause Readmission of Neurosurgery Patients Using Large Language Models
Lavender Jiang ... Cordelia Marcela Orillac
Neurosurgery | VOL. 70
Lavender Jiang, et. al.Lavender Jiang ... Cordelia Marcela Orillac
01 Apr 2024
754 Prediction of 30-day All-Cause Readmission of Neurosurgery Patients Using Large Language Models
Lavender Jiang ... Cordelia Marcela Orillac

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Automated Program Refinement: Guide and Verify Code Large Language Model with Refinement Calculus

Abstract

Published Version

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages