Efficient keyword search for smallest LCAs in XML databases

Yu Xu,Yannis Papakonstantinou

doi:10.1145/1066157.1066217

Abstract

Keyword search is a proven, user-friendly way to query HTML documents in the World Wide Web. We propose keyword search in XML documents, modeled as labeled trees, and describe corresponding efficient algorithms. The proposed keyword search returns the set of trees containing all keywords, where a tree is designated as smallest if it contains no tree that also contains all keywords. Our core contribution, the Indexed Lookup Eager algorithm, exploits key properties of trees in order to outperform prior algorithms by orders of magnitude when the query contains keywords with significantly different frequencies. The Scan Eager variant is tuned for the case where the keywords have similar frequencies. We analytically and experimentally evaluate two variants of the Eager algorithm, along with the Stack algorithm [13]. We also present the XKSearch system, which utilizes the Indexed Lookup Eager, Scan Eager and Stack algorithms and a demo of which on DBLP data is available at http://www.db.ucsd.edu/projects/xksearch. Finally, we extend the Indexed Lookup Eager algorithm to answer Lowest Common Ancestor (LCA) queries.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient keyword search for smallest LCAs in XML databases

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Foundation of Keyword Search in XML
Weidong Yang ... Hao Zhu
-
Weidong Yang, et. al.Weidong Yang ... Hao Zhu
01 Jan 2013
01 Jan 2013

Efficient LCA based keyword search in XML data
Yu Xu ... Yannis Papakonstantinou
-
Yu Xu, et. al.Yu Xu ... Yannis Papakonstantinou
25 Mar 2008
25 Mar 2008

Efficient LCA based keyword search in XML data
Yu Xu ... Yannis Papakonstantinou
-
Yu Xu, et. al.Yu Xu ... Yannis Papakonstantinou
01 Jan 2008
01 Jan 2008

Efficient LCA based keyword search in xml data
Yu Xu ... Yannis Papakonstantinou
-
Yu Xu, et. al.Yu Xu ... Yannis Papakonstantinou
06 Nov 2007
06 Nov 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient keyword search for smallest LCAs in XML databases

Abstract

Talk to us

Similar Papers