Increasing the Applicability of Scalar Replacement

Byoungro So,Mary Hall

doi:10.1007/978-3-540-24723-4_13

Abstract

This paper describes an algorithm for scalar replacement, which replaces repeated accesses to an array element with a scalar temporary. The element is accessed from a register rather than memory, thereby eliminating unnecessary memory accesses. A previous approach to this problem combines scalar replacement with a loop transformation called unroll-and-jam, whereby outer loops in a nest are unrolled, and the resulting duplicate inner loop bodies are fused together. The effect of unroll-and-jam is to bring opportunities for scalar replacement into inner loop bodies. In this paper, we describe an alternative approach that can exploit reuse opportunities across multiple loops in a nest, and without requiring unroll-and-jam. We also use this technique to eliminate unnecessary writes back to memory. The approach described in this paper is particularly well-suited to architectures with large register files and efficient mechanisms for register-to-register transfer. From our experimental results mapping 5 multimedia kernels to an FPGA platform, assuming 32 registers, we observe a 58 to 90 percent of reduction in memory accesses and speedup 2.34 to 7.31 over original programs.KeywordsMemory AccessOuter LoopLoop NestLoop BodyDependence VectorThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Increasing the Applicability of Scalar Replacement

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Scalar replacement in the presence of conditional control flow
Steve Carr ... Ken Kennedy
Software: Practice and Experience | VOL. 24
Steve Carr, et. al.Steve Carr ... Ken Kennedy
01 Jan 1993
Software: Practice and Experience | VOL. 24

Energy-efficient mechanisms for managing thread context in throughput processors
Mark Gebhart ... David Tarjan
ACM SIGARCH Computer Architecture News | VOL. 39
Mark Gebhart, et. al.Mark Gebhart ... David Tarjan
04 Jun 2011
ACM SIGARCH Computer Architecture News | VOL. 39

Energy-efficient mechanisms for managing thread context in throughput processors
Mark Gebhart ... David Tarjan
-
Mark Gebhart, et. al.Mark Gebhart ... David Tarjan
04 Jun 2011
04 Jun 2011

Early Register Release for Out-of-Order Processors with RegisterWindows
Eduardo Quinones ... Joan-Manuel Parcerisa
-
Eduardo Quinones, et. al.Eduardo Quinones ... Joan-Manuel Parcerisa
01 Sep 2007
01 Sep 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Increasing the Applicability of Scalar Replacement

Abstract

Talk to us

Similar Papers