Efficient error-tolerant query autocompletion

Chuan Xiao,Koji Tsuda,Wei Wang,Kunihiko Sadakane,Jianbin Qin,Yoshiharu Ishikawa

doi:10.14778/2536336.2536339

Abstract

Query autocompletion is an important feature saving users many keystrokes from typing the entire query. In this paper we study the problem of query autocompletion that tolerates errors in users' input using edit distance constraints. Previous approaches index data strings in a trie, and continuously maintain all the prefixes of data strings whose edit distance from the query are within the threshold. The major inherent problem is that the number of such prefixes is huge for the first few characters of the query and is exponential in the alphabet size. This results in slow query response even if the entire query approximately matches only few prefixes.In this paper, we propose a novel neighborhood generation-based algorithm, IncNGTrie, which can achieve up to two orders of magnitude speedup over existing methods for the error-tolerant query autocompletion problem. Our proposed algorithm only maintains a small set of active nodes, thus saving both space and time to process the query. We also study efficient duplicate removal which is a core problem in fetching query answers. In addition, we propose optimization techniques to reduce our index size, as well as discussions on several extensions to our method. The efficiency of our method is demonstrated against existing methods through extensive experiments on real datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient error-tolerant query autocompletion

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Apr 1, 2013
Citations: 66

Similar Papers

Efficient query autocompletion with edit distance-based error tolerance
Jianbin Qin ... Sheng Hu
The VLDB Journal | VOL. 29
Jianbin Qin, et. al.Jianbin Qin ... Sheng Hu
14 Dec 2019
The VLDB Journal | VOL. 29

A Hierarchical Index Structure for Region-Aware Spatial Keyword Search with Edit Distance Constraint
Junye Yang ... Chunxiao Xing
-
Junye Yang, et. al.Junye Yang ... Chunxiao Xing
01 Jan 2019
01 Jan 2019

Analyzing User's Sequential Behavior in Query Auto-Completion via Markov Processes
Liangda Li ... Ricardo Baeza-Yates
-
Liangda Li, et. al.Liangda Li ... Ricardo Baeza-Yates
09 Aug 2015
09 Aug 2015

Efficient structure similarity searches: a partition-based approach
Xiang Zhao ... Chuan Xiao
The VLDB Journal | VOL. 27
Xiang Zhao, et. al.Xiang Zhao ... Chuan Xiao
24 Oct 2017
The VLDB Journal | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient error-tolerant query autocompletion

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment