Abstract
In this paper, we address the problem of storing and retrieving XML data over structured Peer-to-Peer (P2P) networks. These are becoming popular because of their access efficiency. An open problem with such networks is represented by the kinds of queries they can handle. In fact, file name lookups, used in popular unstructured networks, are not suitable for many new data formats. Keyword-based searches are also not appropriate for XML data, which must be identified by the whole path leading to an element, rather than by the sole element name. We discuss the extensions needed to properly identify XML data in structured P2P networks. A global document is split into various fragments, which are locally stored within the peers according to their own themes. Each fragment is enhanced with a set of few lightweight path expressions that have the convenient side effect of yielding a decentralized catalog. Since a mediated global schema would not be a reasonable assumption in an highly dynamic P2P network, we show that XPath query evaluation only relying on this catalog effectively biases the search towards particular peers. Our approach does not suffer the network and data size limits of previous proposals and is scalable for large P2P networks. To validate our ideas, we have devised XP2P, namely XPath for P2P, on which we have conducted a comprehensive experimental study using various XML datasets.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.