Abstract

Computer assisted detection or automatic detection for plagiarism could help human to check whether an author of a paper do plagiarism or not. Department of Electrical Engineering, Universitas Indonesia had been developing cross-language automatic plagiarism detection which test paper is written on Indonesian and reference paper written on English. More accurate automatic detection system is needed to prevent plagiarism act, especially on academic paper. The system is based on Latent Semantic Analysis (LSA) algorithm with addition of Self-Organizing Map (SOM) to do classification of the output from LSA. Some features for SOM are extracted from singular value matrix from LSA, they are Frobenius Norm and Cosine Similarity. Together with percentage of technical term, all of the features are used as the input for SOM to classify into 10, 5, 3, and 2 classes. The use of 5 classes in LSA could give equal accuracy for all classes, with the highest accuracy reach 83.09%. While in LSA-SOM, the best accuracy is 83.53% for training data and 80.47% for testing data, in 2-classes configuration with 3 features, they were percentage of technical term, frobenius norm, and pad.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call