A Comparison of Source Code Plagiarism Detection Engines

Thomas Lancaster,Fintan Culwin

doi:10.1080/08993400412331363843

A Comparison of Source Code Plagiarism Detection Engines

Thomas Lancaster, Fintan Culwin

https://doi.org/10.1080/08993400412331363843

Copy DOI

Journal: Computer Science Education	Publication Date: Jun 1, 2004
Citations: 104

Affiliation: Birmingham City University, London South Bank University

#Common Substrings #Student Submissions + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Automated techniques for finding plagiarism in student source code submissions have been in use for over 20 years and there are many available engines and services. This paper reviews the literature on the major modern detection engines, providing a comparison of them based upon the metrics and techniques they deploy. Generally the most common and effective techniques are seen to involve tokenising student submissions then searching pairs of submissions for long common substrings, an example of what is defined to be a paired structural metric. Computing academics are recommended to use one of the two Web-based detection engines, MOSS and JPlag. It is shown that whilst detection is well established there are still places where further research would be useful, particularly where visual support of the investigation process is possible.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Computer Science Education

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.