CLCD-I: Cross-Language Clone Detection by Using Deep Learning with InferCode

Mohammad A Yahya,Dae-Kyoo Kim

doi:10.3390/computers12010012

Abstract

Source code clones are common in software development as part of reuse practice. However, they are also often a source of errors compromising software maintainability. The existing work on code clone detection mainly focuses on clones in a single programming language. However, nowadays software is increasingly developed on a multilanguage platform on which code is reused across different programming languages. Detecting code clones in such a platform is challenging and has not been studied much. In this paper, we present CLCD-I, a deep neural network-based approach for detecting cross-language code clones by using InferCode which is an embedding technique for source code. The design of our model is twofold: (a) taking as input InferCode embeddings of source code in two different programming languages and (b) forwarding them to a Siamese architecture for comparative processing. We compare the performance of CLCD-I with LSTM autoencoders and the existing approaches on cross-language code clone detection. The evaluation shows the CLCD-I outperforms LSTM autoencoders by 30% on average and the existing approaches by 15% on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers	Publication Date: Jan 4, 2023
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CLCD-I: Cross-Language Clone Detection by Using Deep Learning with InferCode

Abstract

Talk to us

Similar Papers

More From: Computers

Lead the way for us

Similar Papers

You Look so Different: Finding Structural Clones and Subclones in Java Source Code
Wolfram Amme ... Thomas S Heinze
-
Wolfram Amme, et. al.Wolfram Amme ... Thomas S Heinze
01 Sep 2021
01 Sep 2021

Insights into Deep Learning and Non-Deep Learning Techniques for Code Clone Detection
Ajinkya Kunjir
-
Ajinkya KunjirAjinkya Kunjir
08 May 2024
08 May 2024

An efficient new multi-language clone detection approach from large source code
Saif Ur Rehman ... Kamran Khan
-
Saif Ur Rehman, et. al.Saif Ur Rehman ... Kamran Khan
01 Oct 2012
01 Oct 2012

Supporting clone analysis with tag cloud visualization
Manamu Sano ... Yuki Yamanaka
-
Manamu Sano, et. al.Manamu Sano ... Yuki Yamanaka
16 Nov 2014
16 Nov 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CLCD-I: Cross-Language Clone Detection by Using Deep Learning with InferCode

Abstract

Talk to us

Similar Papers

More From: Computers