Two-phase schema matching in real world relational databases

Nikolaos Bozovic,Vasilis Vassalos

doi:10.1109/icdew.2008.4498334

Abstract

We propose a new approach to the problem of schema matching in relational databases that merges the hybrid and composite approach of combining multiple individual matching techniques. In particular, we propose assigning individual matchers to two categories, “strong” matchers that provide apriori higher quality matches, and “weak” matchers that may be more sensitive to the inputs and are less reliable but can still help generate some matches. Matching is correspondingly done in two phases, with strong “matches” being produced by strong matchers being combined using a simple voting combiner, and weak matchers providing additional evidence for attributes left unmatched (again using a voting combiner). We observe that, while many recent advances in schema matching [2][5][7][11] use composite schema matching and rely on the existence of training schemas to train combiners, in many real-world situations it is not feasible to employ learning techniques because of the unavailability of training data (i.e., schemas or instance data.) We hypothesize that “weak” matchers can often hurt overall accuracy if used in a “single-phase” composite matcher that does not employ learning techniques. We implement our two-stage approach in the ASID system and evaluate it using real life schemas. The experiments validate our hypothesis regarding the negative effect of “weak” matchers and also show ASID performs comparably to state of the art systems while requiring no training schemas. We also demonstrate the benefits of a simple documentation-based matcher. Our experimental data included schemas ranging from 20 to 120 attributes. Note that schemas with 120 attributes are as large or larger than other published evaluations of relational schema matching.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-phase schema matching in real world relational databases

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Survey of Schema Matching Research using Database Schemas and Instances
Ali A ... Azlin Nordin
International Journal of Advanced Computer Science and Applications | VOL. 8
Ali A, et. al.Ali A ... Azlin Nordin
01 Jan 2017
International Journal of Advanced Computer Science and Applications | VOL. 8

Matching schemas of heterogeneous relational databases
Yaser Karasneh ... Hamidah Ibrahim
-
Yaser Karasneh, et. al.Yaser Karasneh ... Hamidah Ibrahim
01 Aug 2009
01 Aug 2009

Ontology based schema matching and mapping approach for structured databases
Su Su Hlaing
-
Su Su HlaingSu Su Hlaing
24 Nov 2009
24 Nov 2009

An evolutionary approach to complex schema matching
Moisés Gomes De Carvalho ... Altigran S Da Silva
Information Systems | VOL. 38
Moisés Gomes De Carvalho, et. al.Moisés Gomes De Carvalho ... Altigran S Da Silva
22 Oct 2012
Information Systems | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-phase schema matching in real world relational databases

Abstract

Talk to us

Similar Papers