Full-Text and Structural Indexing of XML Documents on B+-Tree

T Shimizu

doi:10.1093/ietisy/e89-d.1.237

Abstract

XML query processing is one of the most active areas of database research. Although the main focus of past research has been the processing of structural XML queries, there are growing demands for a full-text search for XML documents. In this paper, we propose XICS (XML Indices for Content and Structural search), which aims at high-speed processing of both full-text and structural queries in XML documents. An important design principle of our indices is the use of a B+-tree. To represent the structural information of XML trees, each node in the XML tree is labeled with an identifier. The identifier contains an integer number representing the path information from the root node. XICS consist of two types of indices, the COB-tree (COntent B+-tree) and the STB-tree (STructure B+-tree). The search keys of the COB-tree are a pair of text fragments in the XML document and the identifiers of the leaf nodes that contain the text, whereas the search keys of the STB-tree are the node identifiers. By using a node identifier in the search keys, we can retrieve only the entries that match the path information in the query. The STB-tree can filter nodes using structural conditions in queries, while the COB-tree can filter nodes using text conditions. We have implemented a COB-tree and an STB-tree using GiST and examined index size and query processing time. Our experimental results show the efficiency of XICS in query processing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Full-Text and Structural Indexing of XML Documents on B+-Tree

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems

Lead the way for us

Journal: IEICE Transactions on Information and Systems	Publication Date: Jan 1, 2006
Citations: 18

Similar Papers

Full-Text and Structural XML Indexing on B + -Tree
Toshiyuki Shimizu ... Masatoshi Yoshikawa
-
Toshiyuki Shimizu, et. al.Toshiyuki Shimizu ... Masatoshi Yoshikawa
01 Jan 2004
01 Jan 2004

Efficient structural join processing algorithms
Kaiyang Liu
-
Kaiyang LiuKaiyang Liu
23 Dec 2014
23 Dec 2014

Dynamic interval-based labeling scheme for efficient XML query and update processing
Jung-Hee Yun ... Chin-Wan Chung
The Journal of Systems & Software | VOL. 81
Jung-Hee Yun, et. al.Jung-Hee Yun ... Chin-Wan Chung
07 Jun 2007
The Journal of Systems & Software | VOL. 81

Containment join size estimation
Wei Wang ... Hongjun Lu
-
Wei Wang, et. al.Wei Wang ... Hongjun Lu
09 Jun 2003
09 Jun 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Full-Text and Structural Indexing of XML Documents on B+-Tree

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems