Efficient parallel lists intersection and index compression algorithms using graphics processing units

Naiyong Ao,Sheng Lin,Gang Wang,Douglas S Stones,Di Wu,Fan Zhang,Xiaoguang Liu,Jing Liu

doi:10.14778/2002974.2002975

Abstract

Major web search engines answer thousands of queries per second requesting information about billions of web pages. The data sizes and query loads are growing at an exponential rate. To manage the heavy workload, we consider techniques for utilizing a Graphics Processing Unit (GPU). We investigate new approaches to improve two important operations of search engines -- lists intersection and index compression. For lists intersection, we develop techniques for efficient implementation of the binary search algorithm for parallel computation. We inspect some representative real-world datasets and find that a sufficiently long inverted list has an overall linear rate of increase. Based on this observation, we propose Linear Regression and Hash Segmentation techniques for contracting the search range. For index compression, the traditional d-gap based compression schemata are not well-suited for parallel computation, so we propose a Linear Regression Compression schema which has an inherent parallel structure. We further discuss how to efficiently intersect the compressed lists on a GPU. Our experimental results show significant improvements in the query processing throughput on several datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient parallel lists intersection and index compression algorithms using graphics processing units

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: May 1, 2011
Citations: 107

Similar Papers

Using graphics processors for high-performance IR query processing
Shuai Ding ... Hao Yan
-
Shuai Ding, et. al.Shuai Ding ... Hao Yan
21 Apr 2008
21 Apr 2008

Efficient GPU-Based Query Processing with Pruned List Caching in Search Engines
Dongdong Wang ... Junjie Ren
-
Dongdong Wang, et. al.Dongdong Wang ... Junjie Ren
01 Dec 2017
01 Dec 2017

Fast lists intersection with Bloom filter using graphics processing units
Fan Zhang ... Xiaoguang Liu
-
Fan Zhang, et. al.Fan Zhang ... Xiaoguang Liu
21 Mar 2011
21 Mar 2011

Embedded multicore computing and applications
Frédéric Magoulès ... Jia Hu
Concurrency and Computation: Practice and Experience | VOL. 28
Frédéric Magoulès, et. al.Frédéric Magoulès ... Jia Hu
02 Aug 2016
Concurrency and Computation: Practice and Experience | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient parallel lists intersection and index compression algorithms using graphics processing units

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment