Semantic similarity metrics for evaluating source code summarization

Sakib Haque,Aakash Bansal,Collin Mcmillan,Zachary Eberhart

doi:10.1145/3524610.3527909

Abstract

Source code summarization involves creating brief descriptions of source code in natural language. These descriptions are a key component of software documentation such as JavaDocs. Automatic code summarization is a prized target of software engineering research, due to the high value summaries have to programmers and the simultaneously high cost of writing and maintaining documentation by hand. Current work is almost all based on machine models trained via big data input. Large datasets of examples of code and summaries of that code are used to train an e.g. encoder-decoder neural model. Then the output predictions of the model are evaluated against a set of reference summaries. The input is code not seen by the model, and the prediction is compared to a reference. The means by which a prediction is compared to a reference is essentially word overlap, calculated via a metric such as BLEU or ROUGE. The problem with using word overlap is that not all words in a sentence have the same importance, and many words have synonyms. The result is that calculated similarity may not match the perceived similarity by human readers. In this paper, we conduct an experiment to measure the degree to which various word overlap metrics correlate to human-rated similarity of predicted and reference summaries. We evaluate alternatives based on current work in semantic similarity metrics and propose recommendations for evaluation of source code summarization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semantic similarity metrics for evaluating source code summarization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Semantic similarity loss for neural source code summarization
Chia‐Yi Su ... Collin Mcmillan
Journal of Software: Evolution and Process | VOL. -
Chia‐Yi Su, et. al.Chia‐Yi Su ... Collin Mcmillan
07 Jul 2024
Journal of Software: Evolution and Process | VOL. -

A Survey of Automatic Source Code Summarization
Chunyan Zhang ... Qinglei Zhou
Symmetry | VOL. 14
Chunyan Zhang, et. al.Chunyan Zhang ... Qinglei Zhou
25 Feb 2022
Symmetry | VOL. 14

Action Word Prediction for Neural Source Code Summarization
Sakib Haque ... Aakash Bansal
-
Sakib Haque, et. al.Sakib Haque ... Aakash Bansal
01 Mar 2021
01 Mar 2021

An empirical study of the textual similarity between source code and source code summaries
Paul W Mcburney ... Collin Mcmillan
Empirical Software Engineering | VOL. 21
Paul W Mcburney, et. al.Paul W Mcburney ... Collin Mcmillan
09 Nov 2014
Empirical Software Engineering | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic similarity metrics for evaluating source code summarization

Abstract

Talk to us

Similar Papers