Two-Level Task Scheduling for Irregular Applications on GPU Platform

Jing Li,Xiaobing Feng,Yuan Wu,Lei Liu,Chunlin Wu

doi:10.1007/s10766-015-0387-0

Abstract

With a data parallel design, GPUs depend on uniform work distribution to expose their full potential. Therefore, irregular applications suffer from serious performance degradation as it is highly challenging to schedule irregular tasks on a GPU: It requires understandings of GPU architecture and irregular applications to devise a scheduling most suitable in this context, not to mention error-prone concurrent programming. This paper proposes a two-level scheduling to distribute irregular tasks and enable resource sharing on GPUs, by managing tasks and threads hierarchically. Meanwhile, we manage to group cache friendly tasks for more data reuse in L1 cache. We further extend our scheduling to handle nested irregularities. Besides, we devise a programming framework to facilitate the task scheduling for application programmers. The experimental results show that our approach effectively improves performance of six irregular applications on a typical platform, yielding a harmonic-mean speedup of $$2.1\times $$2.1× at a small schedule cost, and does not burden programmers with lots of work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-Level Task Scheduling for Irregular Applications on GPU Platform

Abstract

Talk to us

Similar Papers

More From: International Journal of Parallel Programming

Lead the way for us

Similar Papers

The Implementation and Optimization of Irregular Application Task Models Based on the Cell BE Processor
Jilin Zhang ... Yuyu Yin
International Journal of Digital Content Technology and its Applications | VOL. 6
Jilin Zhang , et. al.Jilin Zhang ... Yuyu Yin
29 Feb 2012
International Journal of Digital Content Technology and its Applications | VOL. 6

A Replication Software Architecture(RSA) for Supporting Irregular Applications on Wide-Area Distributed Computing Environments
Jaechun No ... Chang Won Park
-
Jaechun No, et. al.Jaechun No ... Chang Won Park
01 Jan 2007
01 Jan 2007

Design of low power L2 cache architecture using partial way tag information
A Divya Jebaseeli ... M Kiruba
-
A Divya Jebaseeli, et. al.A Divya Jebaseeli ... M Kiruba
01 Mar 2014
01 Mar 2014

An Energy-Efficient L2 Cache Architecture Using Way Tag Information Under Write-Through Policy
Jianwei Dai ... Lei Wang
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 21
Jianwei Dai, et. al.Jianwei Dai ... Lei Wang
01 Jan 2013
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Level Task Scheduling for Irregular Applications on GPU Platform

Abstract

Talk to us

Similar Papers

More From: International Journal of Parallel Programming