Uninterpreted Schema Matching with Embedded Value Mapping under Opaque Column Names and Data Values

Anuj Jaiswal,David J Miller,Prasenjit Mitra

doi:10.1109/tkde.2009.69

Abstract

Schema matching and value mapping across two heterogeneous information sources are critical tasks in applications involving data integration, data warehousing, and federation of databases. Before data can be integrated from multiple tables, the columns and the values appearing in the tables must be matched. The complexity of the problem grows quickly with the number of data attributes/columns to be matched and due to multiple semantics of data values. Traditional research has tackled schema matching and value mapping independently. We propose a novel method that optimizes embedded value mappings to enhance schema matching in the presence of opaque data values and column names. In this approach, the fitness objective for matching a pair of attributes from two schemas depends on the value mapping function for each of the two attributes. Suitable fitness objectives include the euclidean distance measure, which we use in our experimental study, as well as relative (cross) entropy. We propose a heuristic local descent optimization strategy that uses sorting and two-opt switching to jointly optimize value mappings and attribute matches. Our experiments show that our proposed technique outperforms earlier uninterpreted schema matching methods, and thus, should form a useful addition to a suite of (semi) automated tools for resolving structural heterogeneity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Uninterpreted Schema Matching with Embedded Value Mapping under Opaque Column Names and Data Values

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Feb 1, 2010
Citations: 42

Similar Papers

Schema matching and embedded value mapping for databases with opaque column names and mixed continuous and discrete-valued data fields
Anuj Jaiswal ... Prasenjit Mitra
ACM Transactions on Database Systems | VOL. 38
Anuj Jaiswal, et. al.Anuj Jaiswal ... Prasenjit Mitra
01 Apr 2013
ACM Transactions on Database Systems | VOL. 38

Coping with Uncertainty in Schema Matching: Bayesian Networks and Agent-Based Modeling Approach
Hicham Assoudi ... Hakim Lounis
-
Hicham Assoudi, et. al.Hicham Assoudi ... Hakim Lounis
01 Jan 2015
01 Jan 2015

Discovering Semantic Matches between Opaque Database Schemas
Wei Chen
-
Wei ChenWei Chen
01 Oct 2011
01 Oct 2011

A Scalable Algorithm for One-to-One, Onto, and Partial Schema Matching with Uninterpreted Column Names and Column Values
Boris Rabinovich ... Mark Last
Journal of Database Management | VOL. 25
Boris Rabinovich, et. al.Boris Rabinovich ... Mark Last
01 Oct 2014
Journal of Database Management | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Uninterpreted Schema Matching with Embedded Value Mapping under Opaque Column Names and Data Values

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering