An empirical study of the textual similarity between source code and source code summaries

Paul W Mcburney,Collin Mcmillan

doi:10.1007/s10664-014-9344-6

Abstract

Source code documentation often contains summaries of source code written by authors. Recently, automatic source code summarization tools have emerged that generate summaries without requiring author intervention. These summaries are designed for readers to be able to understand the high-level concepts of the source code. Unfortunately, there is no agreed upon understanding of what makes up a "good summary." This paper presents an empirical study examining summaries of source code written by authors, readers, and automatic source code summarization tools. This empirical study examines the textual similarity between source code and summaries of source code using Short Text Semantic Similarity metrics. We found that readers use source code in their summaries more than authors do. Additionally, this study finds that accuracy of a human written summary can be estimated by the textual similarity of that summary to the source code.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An empirical study of the textual similarity between source code and source code summaries

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Journal: Empirical Software Engineering	Publication Date: Nov 9, 2014
Citations: 79

Similar Papers

Semantic similarity loss for neural source code summarization
Chia‐Yi Su ... Collin Mcmillan
Journal of Software: Evolution and Process | VOL. -
Chia‐Yi Su, et. al.Chia‐Yi Su ... Collin Mcmillan
07 Jul 2024
Journal of Software: Evolution and Process | VOL. -

Semantic similarity metrics for evaluating source code summarization
Sakib Haque ... Zachary Eberhart
-
Sakib Haque, et. al.Sakib Haque ... Zachary Eberhart
16 May 2022
16 May 2022

A Survey of Automatic Source Code Summarization
Chunyan Zhang ... Qinglei Zhou
Symmetry | VOL. 14
Chunyan Zhang, et. al.Chunyan Zhang ... Qinglei Zhou
25 Feb 2022
Symmetry | VOL. 14

Improved Code Summarization via a Graph Neural Network
Alexander Leclair ... Sakib Haque
-
Alexander Leclair, et. al.Alexander Leclair ... Sakib Haque
13 Jul 2020
13 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An empirical study of the textual similarity between source code and source code summaries

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering