Mobile GPU shader processor based on non-blocking Coarse Grained Reconfigurable Arrays architecture

Kwontaek Kwon,Seokyoon Jung,Sungjin Son,Sangoak Woo,Soojung Ryu,Jeongae Park,Jeongsoo Park

doi:10.1109/fpt.2013.6718353

Abstract

Coarse-grained reconfigurable arrays (CGRAs) based processors provide high performance and energy-efficiency as well as programmability by means of the ability to reconfigure the datapath connecting the ALU arrays. A CGRA based processor executes loop kernels whose schedule should be fixed at compile time. This restriction hinders CGRA from being efficient particularly in accessing external memories or caches whose access time varies greatly. This makes it challenging to build a CGRA based high-performance, energy-efficient mobile GPU because GPU shader execution usually involves massive texture memory accesses which consist of accesses to texture cache and external texture memory. In this paper, we present an Non-blocking Coarse Grained Reconfigurable Arrays (NBC-GRA) architecture which can handle varying-latency operations efficiently. We also propose an improved CGRA based GPU shader processor architecture based on it. Retry buffer enables threads to re-execute later when the required memory access completes. With a non-blocking texture cache, the shader core can execute without stalls even in the case of cache misses. All of these components help to improve CGRA core throughput greatly despite of longer memory access latencies. Evaluation results show that our NBCGRA architecture based shader processor could perform efficiently despite extreme variation of texture cache access latencies and could reduce the shader execution cycles by upto 68% with minimal hardware cost overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mobile GPU shader processor based on non-blocking Coarse Grained Reconfigurable Arrays architecture

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Integrating high speed multipliers in Coarse Grain Reconfigurable Arrays
Stavros Georgiopoulos ... Costas E Goutis
-
Stavros Georgiopoulos, et. al.Stavros Georgiopoulos ... Costas E Goutis
01 Nov 2008
01 Nov 2008

Rapid functional modelling and simulation of coarse grained reconfigurable array architectures
Kunjan Patel ... Séamas Mcgettrick
Journal of Systems Architecture | VOL. 57
Kunjan Patel, et. al.Kunjan Patel ... Séamas Mcgettrick
26 Feb 2011
Journal of Systems Architecture | VOL. 57

Scheduler for Inhomogeneous and Irregular CGRAs with Support for Complex Control Flow
Tajas Ruschke ... Dennis Wolf
-
Tajas Ruschke, et. al.Tajas Ruschke ... Dennis Wolf
01 May 2016
01 May 2016

A Graph-Based Spatial Mapping Algorithm for a Coarse Grained Reconfigurable Architecture Template
Lu Ma ... Wei Ge
-
Lu Ma, et. al.Lu Ma ... Wei Ge
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mobile GPU shader processor based on non-blocking Coarse Grained Reconfigurable Arrays architecture

Abstract

Talk to us

Similar Papers