A privacy preserving efficient protocol for semantic similarity join using long string attributes

Bilal Hawashin,Farshad Fotouhi,Traian Marius Truta

doi:10.1145/1971690.1971696

Abstract

During the similarity join process, one or more sources may not allow sharing the whole data with other sources. In this case, privacy preserved similarity join is required. We showed in our previous work [4] that using long attributes, such as paper abstracts, movie summaries, product descriptions, and user feedbacks, could improve the similarity join accuracy under supervised learning. However, the existing secure protocols for similarity join methods can not be used to join tables using these long attributes. Moreover, the majority of the existing privacy-preserving protocols did not consider the semantic similarities during the similarity join process. In this paper, we introduce a secure efficient protocol to semantically join tables when the join attributes are long attributes. Furthermore, instead of using machine learning methods, which are not always applicable, we use similarity thresholds to decide matched pairs. Results show that our protocol can efficiently join tables using the long attributes by considering the semantic relationships among the long string values. Therefore, it improves the overall secure similarity join performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A privacy preserving efficient protocol for semantic similarity join using long string attributes

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Diffusion Maps: A Superior Semantic Method to Improve Similarity Join Performance
Bilal Hawashin ... Farshad Fotouhi
-
Bilal Hawashin, et. al.Bilal Hawashin ... Farshad Fotouhi
01 Dec 2010
01 Dec 2010

A General Framework for Building Applications with Short and Sparse Documents
...
-
, et. al. ...
17 Jun 2014
17 Jun 2014

Semantic similarity relations and computation in schema integration
Wei Song William ... Janis A Bubenko
Data & Knowledge Engineering | VOL. 19
Wei Song William, et. al.Wei Song William ... Janis A Bubenko
01 May 1996
Data & Knowledge Engineering | VOL. 19

Decision letter: Early language exposure affects neural mechanisms of semantic representations
Jamie Reilly ... Floris P de Lange
-
Jamie Reilly, et. al.Jamie Reilly ... Floris P de Lange
23 Jan 2023
23 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A privacy preserving efficient protocol for semantic similarity join using long string attributes

Abstract

Talk to us

Similar Papers