Identification of high-level concept clones in source code

A Marcus,J.I Maletic

doi:10.1109/ase.2001.989796

Abstract

Source code duplication occurs frequently within large software systems. Pieces of source code, functions, and data types are often duplicated in part or in whole, for a variety of reasons. Programmers may simply be reusing a piece of code via copy and paste or they may be re-inventing the wheel. Previous research on the detection of clones is mainly focused on identifying pieces of code with similar (or nearly similar) structure. Our approach is to examine the source code text (comments and identifiers) and identify implementations of similar high-level concepts (e.g., abstract data types). The approach uses an information retrieval technique (i.e., latent semantic indexing) to statically analyze the software system and determine semantic similarities between source code documents (i.e., functions, files, or code segments). These similarity measures are used to drive the clone detection process. The intention of our approach is to enhance and augment existing clone detection methods that are based on structural analysis. This synergistic use of methods will improve the quality of clone detection. A set of experiments is presented that demonstrate the usage of semantic similarity measure to identify clones within a version of NCSA Mosaic.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identification of high-level concept clones in source code

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining
Feng Zhang ... Qingtian Zeng
Scientific Programming | VOL. 2020
Feng Zhang, et. al.Feng Zhang ... Qingtian Zeng
07 Jul 2020
Scientific Programming | VOL. 2020

Automatic Code Review by Learning the Revision of Source Code
Shu-Ting Shi ... David Lo
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Shu-Ting Shi, et. al.Shu-Ting Shi ... David Lo
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Automatically inferring concern code from program investigation activities
M.P Robillard ... G.C Murphy
-
M.P Robillard, et. al.M.P Robillard ... G.C Murphy
06 Oct 2003
06 Oct 2003

Software Clone Management Towards Industrial Application (Dagstuhl Seminar 12071)
...
-
, et. al. ...
01 Jan 2012
Software Clone Management Towards Industrial Application (Dagstuhl Seminar 12071)
...

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of high-level concept clones in source code

Abstract

Talk to us

Similar Papers