Identifying Incorrect Patches in Program Repair Based on Meaning of Source Code

Quang-Ngoc Phung,Misoo Kim,Eunseok Lee

doi:10.1109/access.2022.3145983

Quang-Ngoc Phung, Misoo Kim + Show 1 more

Open Access

https://doi.org/10.1109/access.2022.3145983

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: Sungkyunkwan University

Abstract

Automatic Program Repair (APR) techniques have shown the potential of reducing debugging costs while improving software quality by generating patches for fixing bugs automatically. However, they often generate many overfitting patches which pass only a specific test-suite but do not fix the bugs correctly. This paper proposes MIPI, a novel approach to reducing the number of overfitting patches generated in the APR. We leverage recent advances in deep learning to exploit the similarity between the patched method’s name (which often encloses the developer’s intention about the code) and the semantic meaning of the method’s body (which represents the actual implemented behavior) for identifying and removing overfitting patches generated by APR tools. Experiments with a large dataset of patches for QuixBugs and Defects4J programs show the promise of our approach. Specifically, in a total of 1,191 patches generated by 23 existing APR tools, MIPI successfully filters out 254 (32%) of the total 797 overfitting patches with a precision of 90% while preserving 93% of the correct patches. MIPI is more precise and less damaging to the APR than existing heuristic patch assessment techniques, achieving a higher recall than automated testing-based techniques that do not have access to the test oracle. In addition, MIPI is highly complementary to existing automated patch assessment techniques.

Highlights

Software is becoming ubiquitous in every aspect of our daily life, but they often contain bugs
Recent studies have shown that a major portion of the plausible patches generated is incorrect, which is known as overfitting patches
We propose a novel patch correctness assessment technique that exploits the developer intention embedded in the method name

Summary

INTRODUCTION

Software is becoming ubiquitous in every aspect of our daily life, but they often contain bugs. APR tools often generate many plausible patches that modify the program at a non-buggy location [33], such patches are probably incorrect even though they are very similar to the original program To alleviate these issues, we need to reflect the intention of the developers behind the original code itself. As different from the similarity-based approaches, our approach uses the developer’s intention enclosed in the meaning of descriptive code elements (e.g., method names), instead of the original code, as the origin coordinate for evaluating the correctness of patches. Code understanding models, such as Code2Vec [39], show impressive results in predicting method names or generating text descriptions for code snippets across different projects Motivated by these successes, we proposed leveraging recent advances in deep learning to automatically identify incorrect patches in APR.

IDENTIFY THE MEANING OF CODE SNIPPETS

PATCH CORRECTNESS CLASSIFIER

DATASET

RESULTS OF RQ1

RESULTS OF RQ2

RQ3: HOW EFFECTIVE IS OUR APPROACH IN IDENTIFYING INCORRECT PATCHES?

RESULT OF RQ4

Method

RESULTS OF RQ5

VIII. CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identifying Incorrect Patches in Program Repair Based on Meaning of Source Code

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

APR4Vul: an empirical study of automatic program repair techniques on real-world Java vulnerabilities
Quang-Cuong Bui ... Riccardo Scandariato
Empirical Software Engineering | VOL. 29
Quang-Cuong Bui, et. al.Quang-Cuong Bui ... Riccardo Scandariato
06 Dec 2023
Empirical Software Engineering | VOL. 29

More Reliable Test Suites for Dynamic APR by using Counterexamples
Amirfarhad Nilizadeh ... Xuan-Bach D Le
-
Amirfarhad Nilizadeh, et. al.Amirfarhad Nilizadeh ... Xuan-Bach D Le
01 Oct 2021
01 Oct 2021

Restore: Retrospective Fault Localization Enhancing Automated Program Repair
Tongtong Xu ... Liushan Chen
IEEE Transactions on Software Engineering | VOL. 48
Tongtong Xu, et. al.Tongtong Xu ... Liushan Chen
02 Apr 2020
IEEE Transactions on Software Engineering | VOL. 48

Exploring True Test Overfitting in Dynamic Automated Program Repair using Formal Methods
Amirfarhad Nilizadeh ... Gary T Leavens
-
Amirfarhad Nilizadeh, et. al.Amirfarhad Nilizadeh ... Gary T Leavens
01 Apr 2021
01 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying Incorrect Patches in Program Repair Based on Meaning of Source Code

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access