An Experimental Study with Fuzzy-Wuzzy (Partial Ratio) for Identifying the Similarity between English and French Languages for Plagiarism Detection

Peluru Janardhana Rao,Kunjam Nageswara Rao,Sitaratnam Gokuruboyina

doi:10.14569/ijacsa.2022.0131047

Peluru Janardhana Rao, Kunjam Nageswara Rao + Show 1 more

Open Access

https://doi.org/10.14569/ijacsa.2022.0131047

Copy DOI

Abstract

With the rapid growth of digital libraries and language translation tools, it is easy to translate text documents from one language to other, which results in cross-language plagiarism. It is more challenging to identify plagiarism among documents in different languages. The main aim of this paper is to translate the French documents into English to detect plagiarism and to extract bilingual lexicons. The parallel corpus is used to compare multilingual text, a collection of similar sentences and sentences that complement each other. A comparative study is presented in this paper, the sentences similarity in bilingual content is found out by using the proposed Fuzzy-Wuzzy (Partial Ratio) based string similarity technique and three various techniques like Levenshtein Distance, Spacy and Fuzzy-Wuzzy (Ratio) similarity techniques in the literature. The string similarity method based on Fuzzy-Wuzzy (Partial Ratio) outperforms in terms of accuracy compared to Spacy, and Fuzzy-Wuzzy (Ratio) techniques for identifying language similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

An Experimental Study with Fuzzy-Wuzzy (Partial Ratio) for Identifying the Similarity between English and French Languages for Plagiarism Detection

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

A String Similarity Evaluation for Healthcare Ontologies Alignment to HL7 FHIR Resources
Athanasios Kiourtis ... Dimosthenis Kyriazis
-
Athanasios Kiourtis, et. al.Athanasios Kiourtis ... Dimosthenis Kyriazis
01 Jan 2019
01 Jan 2019

Research on string similarity algorithm based on Levenshtein Distance
Shengnan Zhang ... Yan Hu
-
Shengnan Zhang, et. al.Shengnan Zhang ... Yan Hu
01 Mar 2017
01 Mar 2017

Application of modified Levenshtein distance for classification of noisy business document images
Oleg Slavin ... Elena Andreeva
-
Oleg Slavin, et. al.Oleg Slavin ... Elena Andreeva
05 Mar 2022
05 Mar 2022

A New String Edit Distance and Applications
Taylor Petty ... Jan Hannig
Algorithms | VOL. 15
Taylor Petty, et. al.Taylor Petty ... Jan Hannig
12 Jul 2022
Algorithms | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Experimental Study with Fuzzy-Wuzzy (Partial Ratio) for Identifying the Similarity between English and French Languages for Plagiarism Detection

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications