Semantic Schema Matching without Shared Instances

Jeffrey Partyka,Latifur Khan,Bhavani Thuraisingham

doi:10.1109/icsc.2009.64

Abstract

Semantic heterogeneity across data sources remains a widespread and relevant problem requiring innovative solutions. Our approach towards resolving semantic disparities among distinct data sources aligns their constituent tables by first choosing attributes for comparison. We then examine their instances and calculate a similarity value between them known as entropy-based distribution (EBD). One method of calculating EBD applies a state-of-the-art instance matching strategy based on N-grams in the data. However, this method often fails because it relies on shared instance data to determine similarity. This results in an overestimation of semantic similarity between unrelated attributes and an underestimation of semantic similarity between related attributes. Our method resolves this using clustering and a measure known as Normalized Google Distance. The EBD is then calculated among all clusters by treating each as a type. We show the effectiveness of our approach over the traditional N-gram approach across multi-jurisdictional datasets by generating impressive results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semantic Schema Matching without Shared Instances

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Enhanced geographically typed semantic schema matching
Jeffrey Partyka ... Shashi Shekhar
Journal of Web Semantics | VOL. 9
Jeffrey Partyka, et. al.Jeffrey Partyka ... Shashi Shekhar
03 Dec 2010
Journal of Web Semantics | VOL. 9

Geospatial Schema Matching with High-Quality Cluster Assurance and Location Mining from Social Network
Latifur Khan ... Satyen Abrol
-
Latifur Khan, et. al.Latifur Khan ... Satyen Abrol
01 Dec 2010
01 Dec 2010

Enhanced Geographically-Typed Semantic Schema Matching
Jeffrey Partyka ... Bhavani Thuraisingham
SSRN Electronic Journal | VOL. -
Jeffrey Partyka, et. al.Jeffrey Partyka ... Bhavani Thuraisingham
01 Jan 2010
SSRN Electronic Journal | VOL. -

Geographically-typed semantic schema matching
Jeffrey Partyka ... Latifur Khan
-
Jeffrey Partyka, et. al.Jeffrey Partyka ... Latifur Khan
04 Nov 2009
04 Nov 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic Schema Matching without Shared Instances

Abstract

Talk to us

Similar Papers