In-cache query co-processing on coupled CPU-GPU architectures

Jiong He,Shuhao Zhang,Bingsheng He

doi:10.14778/2735496.2735497

Jiong He, Shuhao Zhang + Show 1 more

Open Access

https://doi.org/10.14778/2735496.2735497

Copy DOI

Abstract

Recently, there have been some emerging processor designs that the CPU and the GPU (Graphics Processing Unit) are integrated in a single chip and share Last Level Cache (LLC). However, the main memory bandwidth of such coupled CPU-GPU architectures can be much lower than that of a discrete GPU. As a result, current GPU query co-processing paradigms can severely suffer from memory stalls. In this paper, we propose a novel in-cache query co-processing paradigm for main memory On-Line Analytical Processing (OLAP) databases on coupled CPU-GPU architectures. Specifically, we adapt CPU-assisted prefetching to minimize cache misses in GPU query co-processing and CPU-assisted decompression to improve query execution performance. Furthermore, we develop a cost model guided adaptation mechanism for distributing the workload of prefetching, decompression, and query execution between CPU and GPU. We implement a system prototype and evaluate it on two recent AMD APUs A8 and A10. The experimental results show that 1) in-cache query co-processing can effectively improve the performance of the state-of-the-art GPU co-processing paradigm by up to 30% and 33% on A8 and A10, respectively, and 2) our workload distribution adaption mechanism can significantly improve the query performance by up to 36% and 40% on A8 and A10, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the VLDB Endowment	Publication Date: Dec 1, 2014
Citations: 103	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

In-cache query co-processing on coupled CPU-GPU architectures

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Similar Papers

Reducing Inter-Application Interferences in Integrated CPU-GPU Heterogeneous Architecture
Hao Wen ... Wei Zhang
-
Hao Wen, et. al.Hao Wen ... Wei Zhang
01 Oct 2018
01 Oct 2018

Morpheus: Extending the Last Level Cache Capacity in GPU Systems Using Idle GPU Core Resources
Sina Darabi ... Negar Akbarzadeh
-
Sina Darabi, et. al.Sina Darabi ... Negar Akbarzadeh
01 Oct 2022
01 Oct 2022

Improving execution efficiency of just-in-time compilation based query processing on GPUs
Johns Paul ... Chiew Tong Lau
Proceedings of the VLDB Endowment | VOL. 14
Johns Paul, et. al.Johns Paul ... Chiew Tong Lau
01 Oct 2020
Proceedings of the VLDB Endowment | VOL. 14

Performance-Energy Considerations for Shared Cache Management in a Heterogeneous Multicore Processor
Anup Holey ... Vineeth Mekkat
ACM Transactions on Architecture and Code Optimization | VOL. 12
Anup Holey, et. al.Anup Holey ... Vineeth Mekkat
09 Mar 2015
ACM Transactions on Architecture and Code Optimization | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

In-cache query co-processing on coupled CPU-GPU architectures

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment