Role Term-Based Semantic Similarity Technique for Idea Plagiarism Detection

Ahmed Hamza Osman,Hani Moetque

doi:10.14569/ijacsa.2018.090861

Abstract

Most of the text mining systems are based on statistical analysis of term frequency. The statistical analysis of term (phrase or word) frequency captures the importance of the term within a document, but the techniques that had been proposed by now still need to be improved in terms of their ability to detect the plagiarized parts, especially for capturing the importance of the term within a sentence. Two terms can have a same frequency in their documents, but one term pays more to the meaning of its sentences than the other term. In this paper, we want to discriminate between the important term and unimportant term in the meaning of the sentences in order to adopt for idea plagiarism detection. This paper introduces an idea plagiarism detection based on semantic meaning frequency of important terms in the sentences. The suggested method analyses and compares text based on a semantic allocation for each term inside the sentence. SRL offers significant advantages when generating arguments for each sentence semantically. Promising experimental has been applied on the CS11 dataset and results revealed that the proposed technique's performance surpasses its recent peer methods of plagiarism detection in terms of Recall, Precision and F-measure.

Highlights

Given the bigness of the online, plagiarism, or the intended use of somebody else’s original data while not acknowledge its supply, has been a heavy drawback in areas like Literature, Science, and Education
Several works had been done in text plagiarism detection based on the lexical and syntactic structure of the writing and failed to detect the semantic and idea plagiarism
Most of these methods are created for verbatim duplicates, and similarity performance is decreased when dealing with plagiarism with heavy cases [2], due to paraphrasing and semantic similarity cases

Summary

Introduction

Given the bigness of the online, plagiarism, or the intended use of somebody else’s original data while not acknowledge its supply, has been a heavy drawback in areas like Literature, Science, and Education. The challenge is exacerbated when the suspected text generated semantically, which is known as idea plagiarism It is not solely the extra problem of manually capturing the concept or idea performed, the people’s lack of information concerning writing ethical issues and text paraphrasing. Several works had been done in text plagiarism detection based on the lexical and syntactic structure of the writing and failed to detect the semantic and idea plagiarism. Most of these methods are created for verbatim duplicates, and similarity performance is decreased when dealing with plagiarism with heavy cases [2], due to paraphrasing and semantic similarity cases. Velásquez and et al [8]; Weber-Wulff [9])

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2018
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Role Term-Based Semantic Similarity Technique for Idea Plagiarism Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Towards practical genre classification of web documents
George Ferizis ... Peter Bailey
-
George Ferizis, et. al.George Ferizis ... Peter Bailey
23 May 2006
23 May 2006

"Loved ones are not 'visitors' in a patient's life"-The importance of including loved ones in the patient's hospital stay: An international Twitter study of #HospitalsTalkToLovedOnes in times of COVID-19.
Mojca Hriberšek ... Ronita De
Frontiers in Public Health | VOL. 11
Mojca Hriberšek, et. al.Mojca Hriberšek ... Ronita De
26 Jan 2023
Frontiers in Public Health | VOL. 11

Non-Stationary Frequency Analysis of Future Extreme Rainfall using CMIP5 GCMs over the Korean Peninsula
Minsu Jeong ... Sunkwon Yoon
Journal of the Korean Society of Hazard Mitigation | VOL. 18
Minsu Jeong, et. al.Minsu Jeong ... Sunkwon Yoon
30 Apr 2018
Journal of the Korean Society of Hazard Mitigation | VOL. 18

Arabic English Cross-Lingual Plagiarism Detection Based on Keyphrases Extraction, Monolingual and Machine Learning Approach
Mohammed Albared ... Muneer A S Hazaa
Asian Journal of Research in Computer Science | VOL. -
Mohammed Albared, et. al.Mohammed Albared ... Muneer A S Hazaa
13 Feb 2019
Asian Journal of Research in Computer Science | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Role Term-Based Semantic Similarity Technique for Idea Plagiarism Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications