Efficient GPU-Based Query Processing with Pruned List Caching in Search Engines

Dongdong Wang,Wenqing Yu,Xiaoguang Liu,Rebecca J Stones,Junjie Ren,Gang Wang,Mingming Ren

doi:10.1109/icpads.2017.00038

Dongdong Wang, Wenqing Yu + Show 5 more

https://doi.org/10.1109/icpads.2017.00038

Copy DOI

Export

Save

Cite

Publication Date: Dec 1, 2017

Citations: 2

Affiliation: Nankai University, Baidu (China)

Abstract
Full-Text
Similar Papers

Abstract

Listen

There are two inherent obstacles to effectively using Graphics Processing Units (GPUs) for query processing in search engines: (a) the highly restricted GPU memory space, and (b) the CPU-GPU transfer latency. Previously, Ao et al. presented a GPU method for lists intersection, an essential component in AND-based query processing. However, this work assumes the whole inverted index can be stored in GPU memory and does not address document ranking. In this paper, we describe and analyze a GPU query processing method which incorporates both lists intersection and top-k ranking. We introduce a parameterized pruned posting list GPU caching method where the parameter determines how much GPU memory is used for caching. This method allows list caching for large inverted indexes using the limited GPU memory, thereby making a qualitative improvement over previous work. We also give a mathematical model which can identify an approximately optimal choice of the parameter. Experimental results indicate that this GPU approach under the pruned list caching policy achieves better query throughput than its CPU counterpart, even when the inverted index size is much larger than the GPU memory space.

Full Text