Improve Code Quality Research Articles

The ability to automatically generate code, i.e., program synthesis, is one of the most important applications of artificial intelligence (AI). Currently, two AI techniques are leading the way: large language models (LLMs) and genetic programming (GP) methods—each with its strengths and weaknesses. While LLMs have shown success in program synthesis from a task description, they often struggle to generate the correct code due to ambiguity in task specifications, complex programming syntax, and lack of reliability in the generated code. Furthermore, their generative nature limits their ability to fix erroneous code with iterative LLM prompting. Grammar-guided genetic programming (G3P, i.e., one of the top GP methods) has been shown capable of evolving programs that fit a defined Backus–Naur-form (BNF) grammar based on a set of input/output tests that help guide the search process while ensuring that the generated code does not include calls to untrustworthy libraries or poorly structured snippets. However, G3P still faces issues generating code for complex tasks. A recent study attempting to combine both approaches (G3P and LLMs) by seeding an LLM-generated program into the initial population of the G3P has shown promising results. However, the approach rapidly loses the seeded information over the evolutionary process, which hinders its performance. In this work, we propose combining an LLM (specifically ChatGPT) with a many-objective G3P (MaOG3P) framework in two parts: (i) provide the LLM-generated code as a seed to the evolutionary process following a grammar-mapping phase that creates an avenue for program evolution and error correction; and (ii) leverage many-objective similarity measures towards the LLM-generated code to guide the search process throughout the evolution. The idea behind using the similarity measures is that the LLM-generated code is likely to be close to the correct fitting code. Our approach compels any generated program to adhere to the BNF grammar, ultimately mitigating security risks and improving code quality. Experiments on a well-known and widely used program synthesis dataset show that our approach successfully improves the synthesis of grammar-fitting code for several tasks.

Read full abstract

Since its introduction in November 2022, ChatGPT has rapidly gained popularity due to its remarkable ability in language understanding and human-like responses. ChatGPT, based on GPT-3.5 architecture, has shown great promise for revolutionizing various research fields, including code generation. However, the reliability and quality of code generated by ChatGPT remain unexplored, raising concerns about potential risks associated with the widespread use of ChatGPT-driven code generation. In this article, we systematically study the quality of 4,066 ChatGPT-generated programs of code implemented in two popular programming languages, i.e., Java and Python, for 2,033 programming tasks. The goal of this work is threefold. First, we analyze the correctness of ChatGPT on code generation tasks and uncover the factors that influence its effectiveness, including task difficulty, programming language, time that tasks are introduced, and program size. Second, we identify and characterize potential issues with the quality of ChatGPT-generated code. Last, we provide insights into how these issues can be mitigated. Experiments highlight that out of 4,066 programs generated by ChatGPT, 2,756 programs are deemed correct, 1,082 programs provide wrong outputs, and 177 programs contain compilation or runtime errors. Additionally, we further analyze other characteristics of the generated code through static analysis tools, such as code style and maintainability, and find that 1,930 ChatGPT-generated code snippets suffer from maintainability issues. Subsequently, we investigate ChatGPT’s self-repairing ability and its interaction with static analysis tools to fix the errors uncovered in the previous step. Experiments suggest that ChatGPT can partially address these challenges, improving code quality by more than 20%, but there are still limitations and opportunities for improvement. Overall, our study provides valuable insights into the current limitations of ChatGPT and offers a roadmap for future research and development efforts to enhance the code generation capabilities of artificial intelligence models such as ChatGPT.

Read full abstract

Improve Code Quality Research Articles

Related Topics

Articles published on Improve Code Quality

The Scalable Detection and Resolution of Data Clumps Using a Modular Pipeline with ChatGPT

AI-Driven Innovations in Software Engineering: A Review of Current Practices and Future Directions

Integrating smart contracts into the modeling paradigm to harness the potential of models

Design and Research on Evaluation System of Computer Pro-gramming Code Quality

The Role of Artificial Intelligence in Modern Software Engineering

FHIR-Based Arden Syntax Compiler for Clinical Decision Support.

Transforming Software Development Through Generative AI : A Systematic Analysis of Automated Development Practices

AI Copilot for the Modern Developer : Leveraging GenAI in Software Development

Enhancing software engineering practices with generative AI: A framework for automated code synthesis and refactoring

The Origin and Opportunities of Developers’ Perceived Code Accountability in Open Source AI Software Development

MORCoRA: Multi-Objective Refactoring Recommendation Considering Review Availability

Advanced test automation techniques for DevOps: Bridging the gap between test-driven development and continuous deployment in agile environments

Enhancing Program Synthesis with Large Language Models Using Many-Objective Grammar-Guided Genetic Programming

The Role of Machine Learning in Software Development

Refining ChatGPT-Generated Code: Characterizing and Mitigating Code Quality Issues

Testability-driven development: An improvement to the TDD efficiency

Automating modern code review processes with code similarity measurement

ДОСЛІДЖЕННЯ ОСОБЛИВОСТЕЙ ЗАСТОСУВАННЯ ШТУЧНОГО ІНТЕЛЕКТУ В ІНЖЕНЕРІЇ ПРОГРАМНОГО ЗАБЕЗПЕЧЕННЯ

Strategies for implementing or strengthening the DevOps approach in organizations: Analysis and examples

APPLICATION OF ARTIFICIAL INTELLIGENCE IN PROGRAMMING EDUCATION WITHIN A BLENDED LEARNING ENVIRONMENT

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Improve Code Quality Research Articles

Related Topics

Articles published on Improve Code Quality

The Scalable Detection and Resolution of Data Clumps Using a Modular Pipeline with ChatGPT

AI-Driven Innovations in Software Engineering: A Review of Current Practices and Future Directions

Integrating smart contracts into the modeling paradigm to harness the potential of models

Design and Research on Evaluation System of Computer Pro-gramming Code Quality

The Role of Artificial Intelligence in Modern Software Engineering

FHIR-Based Arden Syntax Compiler for Clinical Decision Support.

Transforming Software Development Through Generative AI : A Systematic Analysis of Automated Development Practices

AI Copilot for the Modern Developer : Leveraging GenAI in Software Development

Enhancing software engineering practices with generative AI: A framework for automated code synthesis and refactoring

The Origin and Opportunities of Developers’ Perceived Code Accountability in Open Source AI Software Development

MORCoRA: Multi-Objective Refactoring Recommendation Considering Review Availability

Advanced test automation techniques for DevOps: Bridging the gap between test-driven development and continuous deployment in agile environments

Enhancing Program Synthesis with Large Language Models Using Many-Objective Grammar-Guided Genetic Programming

The Role of Machine Learning in Software Development

Refining ChatGPT-Generated Code: Characterizing and Mitigating Code Quality Issues

Testability-driven development: An improvement to the TDD efficiency

Automating modern code review processes with code similarity measurement

ДОСЛІДЖЕННЯ ОСОБЛИВОСТЕЙ ЗАСТОСУВАННЯ ШТУЧНОГО ІНТЕЛЕКТУ В ІНЖЕНЕРІЇ ПРОГРАМНОГО ЗАБЕЗПЕЧЕННЯ

Strategies for implementing or strengthening the DevOps approach in organizations: Analysis and examples

APPLICATION OF ARTIFICIAL INTELLIGENCE IN PROGRAMMING EDUCATION WITHIN A BLENDED LEARNING ENVIRONMENT