Conservative Extension Research Articles

Large language models (LLMs) have reshaped the landscape of program synthesis. However, contemporary LLM-based code completion systems often hallucinate broken code because they lack appropriate code context, particularly when working with definitions that are neither in the training data nor near the cursor. This paper demonstrates that tighter integration with the type and binding structure of the programming language in use, as exposed by its language server, can help address this contextualization problem in a token-efficient manner. In short, we contend that AIs need IDEs, too! In particular, we integrate LLM code generation into the Hazel live program sketching environment. The Hazel Language Server is able to identify the type and typing context of the hole that the programmer is filling, with Hazel's total syntax and type error correction ensuring that a meaningful program sketch is available whenever the developer requests a completion. This allows the system to prompt the LLM with codebase-wide contextual information that is not lexically local to the cursor, nor necessarily in the same file, but that is likely to be semantically local to the developer's goal. Completions synthesized by the LLM are then iteratively refined via further dialog with the language server, which provides error localization and error messages. To evaluate these techniques, we introduce MVUBench, a dataset of model-view-update (MVU) web applications with accompanying unit tests that have been written from scratch to avoid data contamination, and that can easily be ported to new languages because they do not have large external library dependencies. These applications serve as challenge problems due to their extensive reliance on application-specific data structures. Through an ablation study, we examine the impact of contextualization with type definitions, function headers, and errors messages, individually and in combination. We find that contextualization with type definitions is particularly impactful. After introducing our ideas in the context of Hazel, a low-resource language, we duplicate our techniques and port MVUBench to TypeScript in order to validate the applicability of these methods to higher-resource mainstream languages. Finally, we outline ChatLSP, a conservative extension to the Language Server Protocol (LSP) that language servers can implement to expose capabilities that AI code completion systems of various designs can use to incorporate static context when generating prompts for an LLM.

Read full abstract

Failed Error Propagation greatly reduces the effectiveness of Software Testing by masking faults present in the code. This situation happens when the System Under Test executes a faulty statement, the state of the system is affected by this fault, but the expected output is observed. Therefore, it is a must to assess its impact in the testing process. Squeeziness has been shown to be a useful measure to assess the likelihood of fault masking in deterministic systems. The main goal of this paper is to define a new Squeeziness notion that can be used in a scenario where we may have non-deterministic behaviours. The new notion should be a conservative extension of the previous one. In addition, it would be necessary to evaluate whether the new notion appropriately estimates the likelihood that a component of a system introduces Failed Error Propagation. We defined our black-box scenario where non-deterministic behaviours might appear. Next, we presented a new Squeeziness notion that can be used in this scenario. Finally, we carried out different experiments to evaluate the usefulness of our proposal as an appropriate estimation of the likelihood of Failed Error Propagation. We found a high correlation between our new Squeeziness notion and the likelihood of Failed Error Propagation in non-deterministic systems. We also found that the extra computation time with respect to the deterministic version of Squeeziness was negligible. Our new Squeeziness notion is a good measure to estimate the likelihood of Failed Error Propagation being introduced by a component of a system (potentially) showing non-deterministic behaviours. Since it is a conservative extension of the original notion and the extra computation time needed to compute it, with respect to the time needed to compute the former notion, is very small, we conclude that the new notion can be safely used to assess the likelihood of fault masking in deterministic systems.

Read full abstract

Conservative Extension Research Articles

Related Topics

Articles published on Conservative Extension

Sequential rough set: a conservative extension of Pawlak’s classical rough set

The Riḥlahs and Road Trips of Modern Iraqi Literature: Ṣafāʾ Khulūṣī’s Abū Nuwās fī Amrīkā

A Case for First-Class Environments

Iris-MSWasm: Elucidating and Mechanising the Security Invariants of Memory-Safe WebAssembly

Statically Contextualizing Large Language Models with Typed Holes

Nonconservative extensions by propositional quantifiers and modal incompleteness

Multi-level nonstandard analysis and the axiom of choice

Denying Infinity: Pragmatism in Abraham Robinson’s Philosophy of Mathematics

On Combining Intuitionistic and S4 Modal Logic

Admissible extensions of subtheories of second order arithmetic

On provability logics of Niebergall arithmetic

Axiomatization multisets: a comparative analysis

Residuated Basic Logic

The Cardinal Squaring Principle and an Alternative Axiomatization of NFU

BENIGREEN: Blockchain-Based Energy-Efficient Privacy-Preserving Scheme for Green IoT

Dual counterpart intuitionistic logic

Some techniques for reasoning automatically on co-inductive data structures

Squeeziness for non-deterministic systems

A Gradual Probabilistic Lambda Calculus

Quantifying over information change with common knowledge

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Conservative Extension Research Articles

Related Topics

Articles published on Conservative Extension

Sequential rough set: a conservative extension of Pawlak’s classical rough set

The Riḥlahs and Road Trips of Modern Iraqi Literature: Ṣafāʾ Khulūṣī’s Abū Nuwās fī Amrīkā

A Case for First-Class Environments

Iris-MSWasm: Elucidating and Mechanising the Security Invariants of Memory-Safe WebAssembly

Statically Contextualizing Large Language Models with Typed Holes

Nonconservative extensions by propositional quantifiers and modal incompleteness

Multi-level nonstandard analysis and the axiom of choice

Denying Infinity: Pragmatism in Abraham Robinson’s Philosophy of Mathematics

On Combining Intuitionistic and S4 Modal Logic

Admissible extensions of subtheories of second order arithmetic

On provability logics of Niebergall arithmetic

Axiomatization multisets: a comparative analysis

Residuated Basic Logic

The Cardinal Squaring Principle and an Alternative Axiomatization of NFU

BENIGREEN: Blockchain-Based Energy-Efficient Privacy-Preserving Scheme for Green IoT

Dual counterpart intuitionistic logic

Some techniques for reasoning automatically on co-inductive data structures

Squeeziness for non-deterministic systems

A Gradual Probabilistic Lambda Calculus

Quantifying over information change with common knowledge