Improving instruction-level parallelism by loop unrolling and dynamic memory disambiguation

Jack W Davidson ,Sanjay Jinturkar

doi:10.5555/225160.225184

Abstract

Exploitation of instruction-level parallelism is an effective mechanism for improving the performance of modern super-scalar/VLIW processors. Various software techniques can be applied to increase instruction-level parallelism. This paper describes and evaluates a software technique, dynamic memory disambiguation, that permits loops containing loads and stores to be scheduled more aggressively, thereby exposing more instruction-level parallelism. The results of our evaluation show that when dynamic memory disambiguation is applied in conjunction with loop unrolling, register renaming, and static memory disambiguation, the ILP of memory-intensive benchmarks can be increased by as much as 300 percent over loops where dynamic memory disambiguation is not performed. Our measurements also indicate that for the programs that benefit the most from these optimizations, the register usage does not exceed the number of registers on mast high-performance processors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving instruction-level parallelism by loop unrolling and dynamic memory disambiguation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving instruction-level parallelism by loop unrolling and dynamic memory disambiguation
J.W Davidson ... S Jinturkar
-
J.W Davidson, et. al.J.W Davidson ... S Jinturkar
01 Nov 1995
01 Nov 1995

Modulo scheduling with multiple initiation intervals
N.J Warter-Perez ... N Partamian
-
N.J Warter-Perez, et. al.N.J Warter-Perez ... N Partamian
01 Nov 1995
01 Nov 1995

On the Boosting of Instruction Scheduling by Renaming
... Ted C Yang
The Journal of Supercomputing | VOL. 19
, et. al. ... Ted C Yang
01 Jan 2001
The Journal of Supercomputing | VOL. 19

Making graphs reducible with controlled node splitting
Johan Janssen ... Henk Corporaal
ACM Transactions on Programming Languages and Systems | VOL. 19
Johan Janssen, et. al.Johan Janssen ... Henk Corporaal
01 Nov 1997
ACM Transactions on Programming Languages and Systems | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving instruction-level parallelism by loop unrolling and dynamic memory disambiguation

Abstract

Talk to us

Similar Papers