Predictive Comment Updating With Heuristics and AST-Path-Based Neural Learning: A Two-Phase Approach

Bo Lin,Shangwen Wang,Xiaoguang Mao,Xin Xia,Zhongxin Liu

doi:10.1109/tse.2022.3185458

Abstract

Just-in-time comment update is a promising way to reduce the burden of developers during software maintenance and evolution. Existing approaches can be divided into two categories: the heuristic-based approach and the deep-learning-based approach. The heuristic-based approach is restricted to a specific type of comment updates (i.e., code-indicative updates), but performs well on such type. The effectiveness of deep-learning-based approach is limited but it can handle diverse comment updates. Considering the complementary advantages of existing approaches, an intuitive idea is to combine them for better performance. To investigate this idea, we first conduct a pre-study experiment which shows that to construct an effective comment updater by combining heuristic-based and deep-learning-based approaches, we need to tackle two main challenges: 1) the heuristic-based approach may bring side effects to cases which cannot be updated by it; and 2) the current deep-learning-based approach is with limited effectiveness. Then, we propose a novel two-phase approach named Toper to cope with these two challenges and effectively perform comment updates. In the first phase, Toper integrates nine distinctive features identified through our large-scale empirical analysis into a predictive model, which can predict whether the contents of the comment updates can be found in the corresponding code changes, namely, the comment updates are code-indicative updates. If so, the updates are then generated by an off-the-shelf heuristic-based approach; otherwise, Toper leverages a deep learning model, which we specially designed for non-code-indicative updates, to infer the new comment based on the old comment and code change. Motivated by our manual observation on the limitation of existing approaches on non-code-indicative updates, our deep learning model adopts the Abstract Syntax Tree path technique, which can capture the program structure information for effectively embedding code changes. Our evaluation shows that our approach outperforms the state-of-the-art by around 20% with respect to the number of correct comments it generates. Via in-depth analysis, we illustrate the rationale of each design decision as well as point out potential directions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Software Engineering	Publication Date: Apr 1, 2023
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Predictive Comment Updating With Heuristics and AST-Path-Based Neural Learning: A Two-Phase Approach

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Software Engineering

Lead the way for us

Similar Papers

Fine-grained code changes and bugs: Improving bug prediction

-

01 Jan 2012
01 Jan 2012

Large-scale intent analysis for identifying large-review-effort code changes
Song Wang ... Nachiappan Nagappan
Information and Software Technology | VOL. 130
Song Wang, et. al.Song Wang ... Nachiappan Nagappan
09 Sep 2020
Information and Software Technology | VOL. 130

HatCUP
Hongquan Zhu ... Xincheng He
-
Hongquan Zhu, et. al.Hongquan Zhu ... Xincheng He
16 May 2022
16 May 2022

Improving Just-In-Time Comment Updating via AST Edit Sequence
Jiawen Huang ... Huiqun Yu
International Journal of Software Engineering and Knowledge Engineering | VOL. 32
Jiawen Huang, et. al.Jiawen Huang ... Huiqun Yu
01 Oct 2022
International Journal of Software Engineering and Knowledge Engineering | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predictive Comment Updating With Heuristics and AST-Path-Based Neural Learning: A Two-Phase Approach

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Software Engineering