Predicting Patch Correctness Based on the Similarity of Failing Test Cases

Haoye Tian,Yinghua Li,Andrew Habib,Jacques Klein,Abdoul Kader Kaboré,Tegawendé F Bissyandé,Weiguo Pian,Kui Liu

doi:10.1145/3511096

Abstract

How do we know a generated patch is correct? This is a key challenging question that automated program repair (APR) systems struggle to address given the incompleteness of available test suites. Our intuition is that we can triage correct patches by checking whether each generated patch implements code changes (i.e., behavior) that are relevant to the bug it addresses. Such a bug is commonly specified by a failing test case. Towards predicting patch correctness in APR, we propose a novel yet simple hypothesis on how the link between the patch behavior and failing test specifications can be drawn: similar failing test cases should require similar patches . We then propose BATS , an unsupervised learning-based approach to predict patch correctness by checking patch B ehavior A gainst failing T est S pecification. BATS exploits deep representation learning models for code and patches: For a given failing test case, the yielded embedding is used to compute similarity metrics in the search for historical similar test cases to identify the associated applied patches, which are then used as a proxy for assessing the correctness of the APR-generated patches. Experimentally, we first validate our hypothesis by assessing whether ground-truth developer patches cluster together in the same way that their associated failing test cases are clustered. Then, after collecting a large dataset of 1,278 plausible patches (written by developers or generated by 32 APR tools), we use BATS to predict correct patches: BATS achieves AUC between 0.557 to 0.718 and recall between 0.562 and 0.854 in identifying correct patches. Our approach outperforms state-of-the-art techniques for identifying correct patches without the need for large labeled patch datasets—as is the case with machine learning-based approaches. While BATS is constrained by the availability of similar test cases, we show that it can still be complementary to existing approaches: When combined with a recent approach that relies on supervised learning, BATS improves the overall recall in detecting correct patches. We finally show that BATS is complementary to the state-of-the-art PATCH-SIM dynamic approach for identifying correct patches generated by APR tools.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Predicting Patch Correctness Based on the Similarity of Failing Test Cases

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Journal: ACM Transactions on Software Engineering and Methodology	Publication Date: Aug 22, 2022
Citations: 13

Similar Papers

More Reliable Test Suites for Dynamic APR by using Counterexamples
Amirfarhad Nilizadeh ... Xuan-Bach D Le
-
Amirfarhad Nilizadeh, et. al.Amirfarhad Nilizadeh ... Xuan-Bach D Le
01 Oct 2021
01 Oct 2021

VFix: Value-Flow-Guided Precise Program Repair for Null Pointer Dereferences
Xuezheng Xu ... Jingling Xue
-
Xuezheng Xu, et. al.Xuezheng Xu ... Jingling Xue
01 May 2019
01 May 2019

Exploring True Test Overfitting in Dynamic Automated Program Repair using Formal Methods
Amirfarhad Nilizadeh ... Gary T Leavens
-
Amirfarhad Nilizadeh, et. al.Amirfarhad Nilizadeh ... Gary T Leavens
01 Apr 2021
01 Apr 2021

On the acceptance by code reviewers of candidate security patches suggested by Automated Program Repair tools
Aurora Papotti ... Fabio Massacci
Empirical Software Engineering | VOL. 29
Aurora Papotti, et. al.Aurora Papotti ... Fabio Massacci
03 Aug 2024
Empirical Software Engineering | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting Patch Correctness Based on the Similarity of Failing Test Cases

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology