Multiple Distributed Data Sources Research Articles

Distributed Data Mining (DDM) has been proposed as a means to deal with the analysis of distributed data, where DDM discovers patterns and implements prediction based on multiple distributed data sources. However, DDM faces several problems in terms of autonomy, privacy, performance and implementation. DDM requires homogeneity regarding environment, control, administration and the classification algorithm(s), and such that requirements are too strict and inflexible in many applications. In this paper, we propose the employment of a Multi-Agent System (MAS) to be combined with DDM (MAS-DDM). MAS is a mechanism for creating goal-oriented autonomous agents within shared environments with communication and coordination facilities. We shall show that MAS-DDM is both desirable and beneficial. In MAS-DDM, agents could communicate their beliefs (calculated classification) by covering private and non-sharable data, and other agents decide whether the use of such beliefs in classifying instances and adjusting their prior assumptions about each class of data. In MAS-DDM, we will develop and use a modified Naive Bayesian algorithm because (1) Naive Bayesian has been shown to be the most used algorithm to deal with uncertain data, and (2) to show that even if all agents in MAS-DDM use the same algorithm, MAS-DDM preforms better than DDM approaches with non-communicating processes. Point (2) provide an evidence that the exchange of information between agents helps in increasing the accuracy of the classification task significantly.

Read full abstract

We present incremental view maintenance algorithms for a data warehouse derived from multiple distributed autonomous data sources. We begin with a detailed framework for analyzing view maintenance algorithms for multiple data sources with concurrent updates. Earlier approaches for view maintenance in the presence of concurrent updates typically require two types of messages: one to compute the view change due to the initial update and the other to compensate the view change due to interfering concurrent updates. The algorithms developed in this paper instead perform the compensation locally by using the information that is already available at the data warehouse. The first algorithm, termed SWEEP, ensures complete consistency of the view at the data warehouse in the presence of concurrent updates. Previous algorithms for incremental view maintenance either required a quiescent state at the data warehouse or required an exponential number of messages in terms of the data sources. In contrast, this algorithm does not require that the data warehouse be in a quiescent state for incorporating the new views and also the message complexity is linear in the number of data sources. The second algorithm, termed Nested SWEEP, attempts to compute a composite view change for multiple updates that occur concurrently while maintaining strong consistency.

Read full abstract

Multiple Distributed Data Sources Research Articles

Articles published on Multiple Distributed Data Sources

Multi-Agent System Combined With Distributed Data Mining for Mutual Collaboration Classification

How a BI-wise Responsible Integrated Management System May Support Food Traceability

Learning from Distributed Data Sources Using Random Vector Functional-Link Networks

Assortment of Materialized View: A Comparative Survey in Data Warehouse Environment

Service-Oriented Collaborative Framework for High-Performance Data Transfer in Grids

Report on the 5 th international workshop on the design and management of data warehouses (DMDW'03)

Performance comparison of three alternatives of distributed multidatabase systems: A global query perspective

Efficient view maintenance at data warehouses

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multiple Distributed Data Sources Research Articles

Articles published on Multiple Distributed Data Sources

Multi-Agent System Combined With Distributed Data Mining for Mutual Collaboration Classification

How a BI-wise Responsible Integrated Management System May Support Food Traceability

Learning from Distributed Data Sources Using Random Vector Functional-Link Networks

Assortment of Materialized View: A Comparative Survey in Data Warehouse Environment

Service-Oriented Collaborative Framework for High-Performance Data Transfer in Grids

Report on the 5 th international workshop on the design and management of data warehouses (DMDW'03)

Performance comparison of three alternatives of distributed multidatabase systems: A global query perspective

Efficient view maintenance at data warehouses