Recent years have witnessed the generation of a massive amount of spatial–textual data. In view of this, a new type of query coined spatial keyword query has been proposed to deal with the location-based services with additional keyword constraint. This paper studies one of the most popular spatial keyword queries called Top-k Spatial Keyword Query(TkSKQ). Explicitly speaking, given a set of objects, a TkSKQ finds the k objects that are closest to the querier with each of these k objects satisfying all the keywords specified by the query. This kind of query is of paramount importance in a variety of application domains such as location-based recommendation and advertisement.The state-of-art algorithm for processing a TkSKQ is highly sensitive to the number of query keywords specified in the query such that its performance degrades significantly with an increase in the number of keywords. To remedy this drawback, this paper proposes a novel mechanism that utilizes an additional keyword list to enhance the efficiency of the existing solution. Based on this indexing technique, our algorithm needs only traverse a single quadtree when processing a TkSKQ. Moreover, we study how to prioritize the keywords in the vocabulary so as to optimize the performance of our technique. Furthermore, we deal with a generalized version of the TkSKQ problem, called HkSKQ. A similar technique can also be useful for solving HkSKQ. Experimental results on both synthetic and real data reveal the superiority of our proposed scheme.
Read full abstract