Detection of Source Code Plagiarism Utilizing an Approach Based on Machine Learning

Raddam Sami Mehsen,Hiren D Joshi

doi:10.47839/ijc.23.1.3438

Abstract

Academic institutions, which often publish papers and journals, are ideal testing grounds for the efficacy of counterfeit detection methods. Plagiarism occurs when someone uses the words of another writer without giving that writer proper credit. The proliferation of freeware text editors and the increasing availability of scientific materials online have made the detection of plagiarism a pressing concern; however, the detection of plagiarism in the source code presents a particularly difficult problem. Plagiarism detection algorithms for identification systems and software source code have been the subject of numerous academic investigations. The proposed method combines TF-IDF transformations with K-means clustering to achieve a 99.2% accuracy rate when detecting instances of plagiarism in the source code. This is because it groups similar lines of code together. On the other hand, in comparison to the outcomes produced by the random forest algorithm, the ones that it generates are significantly better. The performance of the MOSS system that was already in place was inferior to that of the system that was used for 90% and 80% of the training set. When contrasting the results, some parameters for evaluation that are considered include precision, recall, and F-measure. The proposed system is implemented in Jupyter Notebook 7 and Python. Also, graphic user interface is designed and implemented to give user friendly experience to the users.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detection of Source Code Plagiarism Utilizing an Approach Based on Machine Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Computing

Lead the way for us

Journal: International Journal of Computing	Publication Date: Apr 1, 2024
License type: cc-by

Similar Papers

A state of art on source code plagiarism detection
Mayank Agrawal ... Dilip Kumar Sharma
-
Mayank Agrawal, et. al.Mayank Agrawal ... Dilip Kumar Sharma
01 Oct 2016
01 Oct 2016

EPlag: A two layer source code plagiarism detection system
Omer Ajmal ... Tenvir Ali
-
Omer Ajmal, et. al.Omer Ajmal ... Tenvir Ali
01 Sep 2013
01 Sep 2013

CPDP: a robust technique for plagiarism detection in source code
...
-
, et. al. ...
19 May 2013
19 May 2013

CPDP: A robust technique for plagiarism detection in source code
Basavaraju Muddu ... Vasudev Bhat
-
Basavaraju Muddu, et. al.Basavaraju Muddu ... Vasudev Bhat
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of Source Code Plagiarism Utilizing an Approach Based on Machine Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Computing