Performance of level 3 BLAS kernels in a dynamically partitioned data-flow environment

P Berger,S Gruszka,I Gottlieb,Y Singer

doi:10.1016/0956-0521(95)00050-x

Abstract

The Dynamically Partitioned Data-Flow (DPDF) model is based on an original analysis concept of the data dependency graph at the instruction level. Instead of a breadth first analysis, as in a classical Data-Flow Model, we execute instructions along data-dependent paths. As a consequence, data locality can be exploited by reusing results between the execution of consecutive instructions. In addition, the different paths are not statically defined but arise from a dynamical partitioning of the graph. This model presents the advantage to support very small cost dynamic scheduling and multitasking strategies. In order to study the efficiency of this new model, a first architecture has been defined. This architecture is currently limited to a single processor with one serial processing unit but four graph analyzing units (called prefetch units). Each of these prefetch units is able to build dynamically its own execution path inside the Data-Flow graph of an application. The efficiency of this architecture is studied on a numerical benchmark composed of a subset of the Livermore loops and of three routines of the Level 3 BLAS (GEMM, SYRK and TRSM). Our goal in these experimentations is to demonstrate the ability of the four prefetch units to feed the ALU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of level 3 BLAS kernels in a dynamically partitioned data-flow environment

Abstract

Talk to us

Similar Papers

More From: Computing Systems in Engineering

Lead the way for us

Similar Papers

Instruction level redundant number computations for fast data intensive processing in asynchronous processors
Jeong-Gun Lee ... Dong-Ik Lee
Journal of Systems Architecture | VOL. 51
Jeong-Gun Lee, et. al.Jeong-Gun Lee ... Dong-Ik Lee
07 Jan 2005
Journal of Systems Architecture | VOL. 51

A comparative analysis of the semi-persistent and dynamic scheduling schemes in NR-V2X mode 2
Luca Lusvarghi ... Maria Luisa Merani
Vehicular Communications | VOL. 42
Luca Lusvarghi, et. al.Luca Lusvarghi ... Maria Luisa Merani
01 Jun 2023
Vehicular Communications | VOL. 42

Research on Network Control System Using Improved EDF Dynamic Scheduling Algorithm
Zai Ping Chen ... Hong Qiao Xu
Advanced Materials Research | VOL. 403-408
Zai Ping Chen, et. al.Zai Ping Chen ... Hong Qiao Xu
01 Nov 2011
Advanced Materials Research | VOL. 403-408

Conceptual graphs as a universal knowledge representation
John F Sowa
Computers & Mathematics with Applications | VOL. 23
John F SowaJohn F Sowa
01 Jan 1992
Computers & Mathematics with Applications | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of level 3 BLAS kernels in a dynamically partitioned data-flow environment

Abstract

Talk to us

Similar Papers

More From: Computing Systems in Engineering