Source code plagiarism detection with low-level structural representation and information retrieval

Oscar Karnalim

doi:10.1080/1206212x.2019.1589944

Abstract

Low-level approach is a recent way for source code plagiarism detection. Instead of relying on source code tokens, it relies on low-level structural representation resulted from compiling given source codes. However, to date, existing low-level approaches are unsuitable for handling large-sized source codes; their comparison takes either quadratic or cubic time complexity. This paper presents a low-level approach with a more-efficient comparison; the comparison only takes linear time complexity with the help of Cosine Correlation in Vector Space Model. According to our evaluation, three findings can be deducted. First, proposed approach can be more efficient and effective than the state-of-the-art approach. Second, low-level structural representation can be more effective and efficient than source code token sequence as a medium for source code plagiarism detection. Third, subsequence length (which is related to n in n-gram) can be inversely proportional to effectiveness without significant impact on efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Source code plagiarism detection with low-level structural representation and information retrieval

Abstract

Talk to us

Similar Papers

More From: International Journal of Computers and Applications

Lead the way for us

Journal: International Journal of Computers and Applications	Publication Date: Mar 19, 2019
Citations: 7

Similar Papers

Academic Source Code Plagiarism Detection by Measuring Program Behavioral Similarity
Hayden Cheers ... Shamus P Smith
IEEE Access | VOL. 9
Hayden Cheers, et. al.Hayden Cheers ... Shamus P Smith
01 Jan 2020
IEEE Access | VOL. 9

Detecting Source Code Plagiarism on .NET Programming Languages using Low-level Representation and Adaptive Local Alignment
Faqih Salban Rabbani ... Oscar Karnalim
Journal of information and organizational sciences | VOL. 41
Faqih Salban Rabbani, et. al.Faqih Salban Rabbani ... Oscar Karnalim
16 Jun 2017
Journal of information and organizational sciences | VOL. 41

Source Code Plagiarism Detection in Academia with Information Retrieval: Dataset and the Observation
Oscar Karnalim ... Hapnes Toba
Informatics in Education | VOL. 18
Oscar Karnalim, et. al.Oscar Karnalim ... Hapnes Toba
16 Oct 2019
Informatics in Education | VOL. 18

DCU@FIRE-2014
Debasis Ganguly ... Gareth J F Jones
-
Debasis Ganguly, et. al.Debasis Ganguly ... Gareth J F Jones
01 Jan 2015
DCU@FIRE-2014
Debasis Ganguly ... Gareth J F Jones

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Source code plagiarism detection with low-level structural representation and information retrieval

Abstract

Talk to us

Similar Papers

More From: International Journal of Computers and Applications