Performance Modeling in CUDA Streams - A Means for High-Throughput Data Processing.

Hao Li,Di Yu,Anand Kumar,Yi-Cheng Tu

doi:10.1109/bigdata.2014.7004245

Abstract

Push-based database management system (DBMS) is a new type of data processing software that streams large volume of data to concurrent query operators. The high data rate of such systems requires large computing power provided by the query engine. In our previous work, we built a push-based DBMS named G-SDMS to harness the unrivaled computational capabilities of modern GPUs. A major design goal of G-SDMS is to support concurrent processing of heterogenous query processing operations and enable resource allocation among such operations. Understanding the performance of operations as a result of resource consumption is thus a premise in the design of G-SDMS. With NVIDIA's CUDA framework as the system implementation platform, we present our recent work on performance modeling of CUDA kernels running concurrently under a runtime mechanism named CUDA stream. Specifically, we explore the connection between performance and resource occupancy of compute-bound kernels and develop a model that can predict the performance of such kernels. Furthermore, we provide an in-depth anatomy of the CUDA stream mechanism and summarize the main kernel scheduling disciplines in it. Our models and derived scheduling disciplines are verified by extensive experiments using synthetic and real-world CUDA kernels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Modeling in CUDA Streams - A Means for High-Throughput Data Processing.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data

Lead the way for us

Journal: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data	Publication Date: Oct 1, 2014
Citations: 52

Similar Papers

Base64 Encoding on Heterogeneous Computing Platforms
Zheming Jin ... Hal Finkel
-
Zheming Jin, et. al.Zheming Jin ... Hal Finkel
01 Jul 2019
01 Jul 2019

RankBoost Acceleration on both NVIDIA CUDA and ATI Stream Platforms
Bo Wang ... Ruirui Li
-
Bo Wang, et. al.Bo Wang ... Ruirui Li
01 Jan 2009
01 Jan 2009

Concurrent query processing in a GPU-based database system.
Hao Li ... Bo Zeng
PloS one | VOL. 14
Hao Li, et. al.Hao Li ... Bo Zeng
16 Apr 2019
PloS one | VOL. 14

Adaptive Query Processing on Raw Data Files

-

01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Modeling in CUDA Streams - A Means for High-Throughput Data Processing.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data