A probabilistic database is defined in a previous article [R. Cavallo and M. Pittarelli, Proc. 13th Int. Conf. on Very Large Databases (VLDB); 1987; see Ref. 9] as a collection of probability distributions over Cartesian products of finite variable domains. the concept is extended here to accommodate interval-valued probabilities. Algebraic operations for both real- and interval-valued probabilities databases analogous to those for relational databases are defined. Techniques for making inferences regarding joint distributions on subsets of the variables over which a probabilistic database is defined are developed. These are illustrated through application to a problem of decision analysis under partial uncertainty. Connections between the probabilistic database formalism and other forms of data representation are discussed.
Read full abstract