WHERE DOES THE SPEEDUP GO: QUANTITATIVE MODELING OF PERFORMANCE LOSSES IN SHARED-MEMORY PROGRAMS

Seon Wook Kim,Rudolf Eigenmann

doi:10.1142/s0129626400000226

Abstract

Even fully parallel shared-memory program sections may perform significantly below the ideal speedup of P on P processors. Relatively little quantitative information is available about the sources of such inefficiencies. In this paper we present a speedup component model that is able to fully account for sources of performance loss in parallel program sections. The model categorizes the gap between measured and ideal speedup into the four components memory stalls, processor stalls, code overhead, and thread management overhead. These model components are measured based on hardware counters and timers, with which programs are instrumented automatically by our compiler. The speedup component model allows us, for the first time, to quantitatively state the reasons for less-than-optimal program performance, on a program section basis. The overhead components are chosen such that they can be associated directly with software and hardware techniques that may improve performance. Although general, our model is especially suited for the analysis of loop-oriented programs, such as those written in the OpenMP API. We have applied this model to compare three parallel code generation schemes for the Polaris parallelizing compiler. It helps us answer questions such as, what sources of inefficiencies are present in compiler-parallelized programs. To discuss the question we have also implemented an alternative, thread-based code generation method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

WHERE DOES THE SPEEDUP GO: QUANTITATIVE MODELING OF PERFORMANCE LOSSES IN SHARED-MEMORY PROGRAMS

Abstract

Talk to us

Similar Papers

More From: Parallel Processing Letters

Lead the way for us

Journal: Parallel Processing Letters	Publication Date: Jun 1, 2000
Citations: 6

Similar Papers

Some Ideas On Parallel Functional Programming
Paul Roe
-
Paul RoePaul Roe
01 Jan 1990
01 Jan 1990

On the exploitation of value prediction and producer identification to reduce barrier synchronization time
K.Z Ibrahim ... G.T Byrd
-
K.Z Ibrahim, et. al.K.Z Ibrahim ... G.T Byrd
23 Apr 2001
23 Apr 2001

Parallel Numeric Algorithms On Faster Computers

Scalable Computing Practice and Experience | VOL. 5

03 Jan 2001
Scalable Computing Practice and Experience | VOL. 5

Efficient Evaluation of Visual Queries Using Deductive Databases
Dimitra Vista ... Peter T Wood
-
Dimitra Vista, et. al.Dimitra Vista ... Peter T Wood
01 Jan 1995
01 Jan 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

WHERE DOES THE SPEEDUP GO: QUANTITATIVE MODELING OF PERFORMANCE LOSSES IN SHARED-MEMORY PROGRAMS

Abstract

Talk to us

Similar Papers

More From: Parallel Processing Letters