Consistent Queries Over Databases with Integrity Constraints

Sergio Greco,Ester Zumpano

doi:10.4018/9781930708389.ch006

Abstract

Integrating data from different sources consists of two main steps, the first in which the various relations are merged together, and the second in which some tuples are removed (or inserted) from the resulting database in order to satisfy integrity constraints. There are several ways to integrate databases or possibly distributed information sources, but whatever integration architecture we choose, the heterogeneity of the sources to be integrated causes subtle problems. In particular, the database obtained from the integration process may be inconsistent with respect to integrity constraints, that is, one or more integrity constraints are not satisfied. Integrity constraints represent an important source of information about the real world. They are usually used to define constraints on data (functional dependencies, inclusion dependencies, etc.) and have, nowadays, a wide applicability in several contexts such as semantic query optimization, cooperative query answering, database integration, and view update. Since the satisfaction of integrity constraints cannot generally be guaranteed, if the database is obtained from the integration of different information sources, in the evaluation of queries, we must compute answers that are consistent with the integrity constraints. The following example shows a case of inconsistency. Example 1: Consider the following database schema consisting of the single binary relation Teaches (Course, Professor) where the attribute Course is a key for the relation. Assume there are two different instances for the relations Teaches, D1={(c1,p1),(c2,p2)} and D2={(c1,p1),(c2,p3)}. The two instances satisfy the constraint that Course is a key, but from their union we derive a relation that does not satisfy the constraint since there are two distinct tuples with the same value for the attribute Course. In the integration of two conflicting databases simple solutions could be based on the definition of preference criteria such as a partial order on the source information or a majority criterion (Lin & Mendelzon, 1996). However, these solutions are not generally satisfactory, and more useful solutions are those based on (1) the computation of “repairs” for the database, and (2) the computation of consistent answers (Arenas, Bertossi, & Chomicki, 1999). The computation of repairs is based on the definition of minimal sets of insertion and deletion operations so that the resulting database satisfies all constraints. The computation of consistent answers is based on the identification of tuples satisfying integrity constraints and on the selection of tuples matching the goal. For instance, for the integrated database of Example 1, we have two alternative repairs consisting in the deletion of one of the tuples (c2,p2) and (c2,p3). The consistent answer to a query over the relation Teaches contains the unique tuple (c1,p1) so that we do not know which professor teaches course c2. Therefore, it is very important, in the presence of inconsistent data, not only to compute the set of consistent answers, but also to know which facts are unknown and if there are possible repairs for the database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Consistent Queries Over Databases with Integrity Constraints

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Consistent Queries over Databases with Integrity Constraints
Luciano Caroprese ... Ester Zumpano
-
Luciano Caroprese, et. al.Luciano Caroprese ... Ester Zumpano
01 Jan 2009
01 Jan 2009

Consistent Queries over Databases with Integrity Constraints
Luciano Caroprese ... Cristian Molinaro
-
Luciano Caroprese, et. al.Luciano Caroprese ... Cristian Molinaro
01 Jan 2009
01 Jan 2009

Consistent Queries Over Databases with Integrity Constraints
Sergio Greco ... Ester Zumpano
-
Sergio Greco, et. al.Sergio Greco ... Ester Zumpano
01 Jan 2002
01 Jan 2002

Logic-based approach to semantic query optimization
Upen S Chakravarthy ... John Grant
ACM Transactions on Database Systems | VOL. 15
Upen S Chakravarthy, et. al.Upen S Chakravarthy ... John Grant
01 Jun 1990
ACM Transactions on Database Systems | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Consistent Queries Over Databases with Integrity Constraints

Abstract

Talk to us

Similar Papers