Computing Structural Statistics by Keywords in Databases

Lu Qin,Lijun Chang,Jeffrey Xu Yu

doi:10.1109/tkde.2012.78

Abstract

Keyword search in RDBs has been extensively studied in recent years. The existing studies focused on finding all or top-k interconnected tuple-structures that contain keywords. In reality, the number of such interconnected tuple-structures for a keyword query can be large. It becomes very difficult for users to obtain any valuable information more than individual interconnected tuple-structures. Also, it becomes challenging to provide a similar mechanism like group-&-aggregate for those interconnected tuple-structures. In this paper, we study computing structural statistics keyword queries by extending the group-&-aggregate framework. We consider an RDB as a large directed graph where nodes represent tuples, and edges represent the links among tuples. Instead of using tuples as a member in a group, we consider rooted subgraphs. Such a rooted subgraph represents an interconnected tuple-structure among tuples and some of the tuples contain keywords. The dimensions of the rooted subgraphs are determined by dimensional keywords in a data driven fashion. Two rooted subgraphs are grouped into the same group if they are isomorphic based on the dimensions or in other words the dimensional keywords. The scores of the rooted subgraphs are computed by a user-given score function if the rooted subgraphs contain some of general keywords. Here, the general keywords are used to compute scores rather than determining dimensions. The aggregates are computed using an sql aggregate function for every group based on the scores computed. We give our motivation using a real data set. We propose new approaches to compute structural statistics keyword queries, perform extensive performance studies using two large real data sets and a large synthetic data set, and confirm the effectiveness and efficiency of our approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Computing Structural Statistics by Keywords in Databases

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Oct 1, 2012
Citations: 2

Similar Papers

Computing structural statistics by keywords in databases
Lu Qin ... Jeffrey Xu Yu
-
Lu Qin, et. al.Lu Qin ... Jeffrey Xu Yu
01 Apr 2011
01 Apr 2011

Ten thousand SQLs
Lu Qin ... Lijun Chang
Proceedings of the VLDB Endowment | VOL. 3
Lu Qin, et. al.Lu Qin ... Lijun Chang
01 Sep 2010
Proceedings of the VLDB Endowment | VOL. 3

Efficient Duplication Free and Minimal Keyword Search in Graphs
Mehdi Kargar ... Xiaohui Yu
IEEE Transactions on Knowledge and Data Engineering | VOL. 26
Mehdi Kargar, et. al.Mehdi Kargar ... Xiaohui Yu
01 Jul 2014
IEEE Transactions on Knowledge and Data Engineering | VOL. 26

Keyword search in databases
Lu Qin ... Jeffrey Xu Yu
-
Lu Qin, et. al.Lu Qin ... Jeffrey Xu Yu
29 Jun 2009
29 Jun 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Computing Structural Statistics by Keywords in Databases

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering