Abstract

In the information integration system, XML becomes an important format for information representation and exchanging. Selection of useful data sources for a query is a crucial problem for efficient query processing in an information integration system. This paper focuses on the data sources selection for XML data sources in the information integration system. For a query with both structural and value constraints, two kinds of indices, constraint index and structural index are presented for data sources selection. The former is grouped by values and captures the structure related to each value in a group. The latter is to summarise all the paths in the XML data sources. In order to reduce the size of index, index compacting and node selection strategies are presented. Based on the structure, efficient data sources selection methods are designed. Extensive experiments are performed to demonstrate the efficiency and effectiveness of the structure and data sources selection strategies presented in this paper.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call