Abstract

Distributed databases and data transformation mechanisms are highly relevant to Business Intelligence and Data Analytics. Most enterprise systems today have multiple and distributed databases, and the growing dissemination of Edge-Oriented architectures is driving this trend across many industries and business domains. The Entity-Relationship (ER) model is fundamental to modeling complex enterprise systems, but it has shortcomings. In particular, the ER model cannot represent data transport between different locations (or databases) of a system, nor can it conceptually express data transformation operations, such as aggregate and line functions, that are standard in data analytics. Therefore, we propose ER+, an extension of the ER model, where data distribution, data transport, data transformation, and information generation for distributed operational and analytical systems can be visually identified. The new ER+ representation has another important benefit, it provides the basis for transforming conceptual models into physical implementations of distributed data-oriented systems. This work introduces new concepts in ER modeling and illustrates their application. The TPC-H use case is also used to demonstrate the the practicality of the ER+ model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call