Unary and n-ary inclusion dependency discovery in relational databases

Fabien De Marchi,Stéphane Lopes,Jean-Marc Petit

doi:10.1007/s10844-007-0048-x

Abstract

Foreign keys form one of the most fundamental constraints for relational databases. Since they are not always defined in existing databases, the discovery of foreign keys turns out to be an important and challenging task. The underlying problem is known to be the inclusion dependency (IND) inference problem. In this paper, data-mining algorithms are devised for IND inference in a given database. We propose a two-step approach. In the first step, unary INDs are discovered thanks to a new preprocessing stage which leads to a new algorithm and to an efficient implementation. In the second step, n-ary IND inference is achieved. This step fits in the framework of levelwise algorithms used in many data-mining algorithms. Since real-world databases can suffer from some data inconsistencies, approximate INDs, i.e. INDs which almost hold, are considered. We show how they can be safely integrated into our unary and n-ary discovery algorithms. An implementation of these algorithms has been achieved and tested against both synthetic and real-life databases. Up to our knowledge, no other algorithm does exist to solve this data-mining problem.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unary and n-ary inclusion dependency discovery in relational databases

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Information Systems

Lead the way for us

Journal: Journal of Intelligent Information Systems	Publication Date: Jan 26, 2008
Citations: 95

Similar Papers

Semantic sampling of existing databases through informative Armstrong databases
Fabien De Marchi ... Jean-Marc Petit
Information Systems | VOL. 32
Fabien De Marchi, et. al.Fabien De Marchi ... Jean-Marc Petit
19 Jan 2006
Information Systems | VOL. 32

Efficient Algorithms for Mining Inclusion Dependencies
Fabien De Marchi ... Stéphane Lopes
-
Fabien De Marchi, et. al.Fabien De Marchi ... Stéphane Lopes
01 Jan 2002
01 Jan 2002

Algorithmes pour la découverte des dépendances d'inclusion dans les bases de données relationnelles
Fabien De Marchi
Ingénierie des systèmes d'information | VOL. 9
Fabien De MarchiFabien De Marchi
24 Aug 2004
Ingénierie des systèmes d'information | VOL. 9

Discovering Foreign Keys on Web Tables with the Crowd
Xiaoyu Wu ... Ning Wang
Computing and Informatics | VOL. 38
Xiaoyu Wu, et. al.Xiaoyu Wu ... Ning Wang
01 Jan 2019
Computing and Informatics | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unary and n-ary inclusion dependency discovery in relational databases

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Information Systems