Automata Approach to XML Data Indexing

Eliška Šestáková,Jan Janoušek

doi:10.3390/info9010012

Abstract

The internal structure of XML documents can be viewed as a tree. Trees are among the fundamental and well-studied data structures in computer science. They express a hierarchical structure and are widely used in many applications. This paper focuses on the problem of processing tree data structures; particularly, it studies the XML index problem. Although there exist many state-of-the-art methods, the XML index problem still belongs to the active research areas. However, existing methods usually lack clear references to a systematic approach to the standard theory of formal languages and automata. Therefore, we present some new methods solving the XML index problem using the automata theory. These methods are simple and allow one to efficiently process a small subset of XPath. Thus, having an XML data structure, our methods can be used efficiently as auxiliary data structures that enable answering a particular set of queries, e.g., XPath queries using any combination of the child and descendant-or-self axes. Given an XML tree model with n nodes, the searching phase uses the index, reads an input query of size m, finds the answer in time O ( m ) and does not depend on the size of the original XML document.

Highlights

Extensible Markup Language (XML), which became a World Wide Web Consortium (W3C)Recommendation in 1998, still belongs to the main methods of exchanging data over the Internet.It plays an important role in many aspects of software development, often to simplify data storage and sharing
Having an XML data structure, our methods can be used efficiently as auxiliary data structures that enable answering a particular set of queries, e.g., XPath queries using any combination of the child and descendant-or-self axes
We propose three indexing methods that are all based on finite state automata

Summary

Introduction

Recommendation in 1998, still belongs to the main methods of exchanging data over the Internet. It plays an important role in many aspects of software development, often to simplify data storage and sharing. Efficient storing and querying of XML data are key tasks that have been extensively studied during the past few years [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15]. We can preprocess the data subject and construct an index

Methods

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Jan 6, 2018
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Automata Approach to XML Data Indexing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

A Comprehensive Analysis of Stack and Queue Data Structures and Their Uses
S Rajasekaran ... Mastan Vali Shaik
-
S Rajasekaran, et. al.S Rajasekaran ... Mastan Vali Shaik
30 Jun 2023
30 Jun 2023

Dynamic external hashing
Zhewei Wei ... Qin Zhang
-
Zhewei Wei, et. al.Zhewei Wei ... Qin Zhang
11 Aug 2009
11 Aug 2009

Classic and new data structure problems in external memory
Zhewei Wei
-
Zhewei WeiZhewei Wei
23 Dec 2014
23 Dec 2014

Incremental XPath evaluation
Henrik Björklund ... Wim Martens
ACM Transactions on Database Systems | VOL. 35
Henrik Björklund, et. al.Henrik Björklund ... Wim Martens
12 Oct 2010
ACM Transactions on Database Systems | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automata Approach to XML Data Indexing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information