Context Of XML Research Articles

XML data sources are more and more gaining popularity in the context of a wide family of Business Intelligence (BI) and On-Line Analytical Processing (OLAP) applications, due to the amenities of XML in representing and managing semi-structured and complex multidimensional data. As a consequence, many XML data warehouse models have been proposed during past years in order to handle hetero-geneity and complexity of multidimensional data in a way traditional relational data warehouse approaches fail to achieve. However, XML-native database systems currently suffer from limited performance, both in terms of volumes of manageable data and query response time. Therefore , recent research efforts are focusing the attention on fragmentation techniques, which are able to overcome the limitations above. Derived horizontal fragmentation is already used in relational data warehouses, and can definitely be adapted to the XML context. However, classical fragmentation algorithms are not suitable to control the number of originated fragments, which instead plays a critical role in data warehouses, and, with more emphasis, distributed data warehouse architectures. Inspired by this research challenge, in this paper we propose the use of K-means clustering algorithm for effectively and efficiently supporting the fragmentation of very large XML data warehouses, and, at the same time, completely controlling and determining the number of originated fragments via adequately setting the parameter K. We complete our analytical contribution by means of a comprehensive experimental assessment where we compare the efficiency of our proposed XML data warehouse fragmentation technique against those of classical derived horizontal fragmentation algorithms adapted to XML data warehouses.

Read full abstract

XML is an ordered data model and XQuery expressions return results that have a well-defined order. However, little work on how order is supported in XML query processing has been done to date. In this paper we study the issues related to handling order in the XML context, namely challenges imposed by the XML data model, the variety of order requirements of the XQuery language, and the need to maintain order in the presence of updates to the XML data. We propose an efficient solution that addresses all these issues. Our solution is based on a key encoding for XML nodes that serves as node identity and at the same time encodes order. We design rules for encoding order of processed XML nodes based on the XML algebraic query execution model and the node key encoding. These rules do not require any actual sorting for intermediate results during execution. Our approach enables efficient order-sensitive incremental view maintenance as it makes most XML algebra operators distributive with respect to bag union. We prove the correctness of our order encoding approach. Our approach is implemented and integrated with Rainbow, an XML data management system developed at WPI. We have tested the efficiency of our approach using queries that have different order requirements. We have also measured the relative cost of different components related to our order solution in different types of queries. In general the overhead of maintaining order in our approach is very small relative to the query processing time.

Read full abstract

Context Of XML Research Articles

Related Topics

Articles published on Context Of XML

Benchmarking JSON Document Stores in Practice

A Survey to View Update Problem

A Survey to View Update Problem

Bidirectionalizing graph transformations

Reasoning about XML with temporal logics and automata

Fragmenting very large XML data warehouses via K-means clustering algorithm

Information preserving XML schema embedding

Technical context and cultural consequences of XML

An information-theoretic approach to normal forms for relational and XML data

Efficiently supporting order in XML query processing

Indexing in an XML context

TIMBER: A native XML database

Active rules for XML: A new paradigm for E-services

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Context Of XML Research Articles

Related Topics

Articles published on Context Of XML

Benchmarking JSON Document Stores in Practice

A Survey to View Update Problem

A Survey to View Update Problem

Bidirectionalizing graph transformations

Reasoning about XML with temporal logics and automata

Fragmenting very large XML data warehouses via K-means clustering algorithm

Information preserving XML schema embedding

Technical context and cultural consequences of XML

An information-theoretic approach to normal forms for relational and XML data

Efficiently supporting order in XML query processing

Indexing in an XML context

TIMBER: A native XML database

Active rules for XML: A new paradigm for E-services