Q-grams-imp: an improved q-grams algorithm aimed at edit similarity join

Yunxia Liu,Zhaobin Liu

doi:10.1504/ijcse.2016.10008631

Abstract

Similarity join is more and more important in many applications and has attracted wide-spread attention from scholars and communities. Similarity join has been used in many applications, such as spell checking, copy detection, entity linking, pattern recognition and so on. Actually, in many web and enterprise scenarios, where typos and misspellings often occur, we need to find an efficient algorithm to handle these situations. In this paper, we propose an improved algorithm on q-grams called q-grams-imp that is aimed at solving edit similarity join. We use this algorithm in order to reduce the number of tokens and thus reduce space costs, so it is fit best for same size strings. But for different sizes of strings, we need to handle these strings in order to fit for the algorithm. Finally, we conclude and get the results that our proposed algorithm is better than the traditional method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Q-grams-imp: an improved q-grams algorithm aimed at edit similarity join

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Science and Engineering

Lead the way for us

Similar Papers

Q-grams-imp: an improved q-grams algorithm aimed at edit similarity join
Yunxia Liu ... Zhaobin Liu
International Journal of Computational Science and Engineering | VOL. 18
Yunxia Liu, et. al.Yunxia Liu ... Zhaobin Liu
01 Jan 2019
International Journal of Computational Science and Engineering | VOL. 18

Dt-Duiveltje
J.J Zuidema ... J Weber
Toegepaste Taalwetenschap in Artikelen | VOL. 35
J.J Zuidema, et. al.J.J Zuidema ... J Weber
01 Jan 1989
Toegepaste Taalwetenschap in Artikelen | VOL. 35

CNN Feature-Based Image Copy Detection with Contextual Hash Embedding
Zhili Zhou ... Yuecheng Su
Mathematics | VOL. 8
Zhili Zhou, et. al.Zhili Zhou ... Yuecheng Su
17 Jul 2020
Mathematics | VOL. 8

Neighbouring Proximity - An Key Impact Factor of Deep Machine Learning
Hongyuan Shi ... Yunke Li
-
Hongyuan Shi, et. al.Hongyuan Shi ... Yunke Li
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Q-grams-imp: an improved q-grams algorithm aimed at edit similarity join

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Science and Engineering